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Abstract. One of the major achievements of the recently emerged quantum 
| information theory is the introduction and thorough investigation of the notion of 

quantum channel which is a basic building block of any data-transmitting or data- 
processing system. This development resulted in an elaborated structural theory 
and was accompanied by the discovery of a whole spectrum of entropic quantities, 
notably the channel capacities, characterizing information-processing performance of 
the channels. 

This paper gives a survey of the main properties of quantum channels and of their 
entropic characterization, with a variety of examples for Unite dimensional quantum 
systems. We also touch upon the "continuous-variables" case, which provides an 
arena for quantum Gaussian systems. Most of the practical realizations of quantum 
information processing were implemented in such systems, in particular based on 
principles of quantum optics. Several important entropic quantities are introduced 
and used to describe the basic channel capacity formulas. The remarkable role of the 
specific quantum correlations - entanglement - as a novel communication resource, is 
stressed. 
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1. Introduction 

The concept of communication channel and its capacity is central in information 
theory [37] where probabilistic models of noise were used with great success. The 
importance of considering quantum channels is due to the fact that any physical 
communication line is after all a quantum one, and it can be treated classically only 
in so far as the quantum noise associated with the fundamental quantum-mechanical 
uncertainty [12] is negligible as compared to the classical fluctuations. This is not 
the case in many modern applications such as optical communication [33] or quantum 
system engineering [135j . calling for a genuinely quantum approach. 

Mathematically, the notion of quantum channel is related to that of dynamical 
map, which emerged in 1960- 70s and goes back to the "operations" of Haag-Kastler [70] 
and Ludwig |128j . The essential property of dynamical maps is positivity ensuring that 
states are transformed into states in the Schrodinger picture. These transformations are 
a quantum analog of the Markov maps (stochastic matrices) in probability theory and 
a natural further development was to consider semigroups of such processes as a model 
of quantum Markovian dynamics (see Kossakowski [116j . Davies [4T]). 

However, an approach based on positivity remained not extremely productive until 
it was observed independently by several researchers (e. g. Kraus [117] , Lindblad 
[122] , Gorini et al. [66], Evans and Lewis [50] in statistical mechanics, and by Holevo 
[82] in the context of quantum communication theory) that the reduced dynamics of 
open quantum systems has a stronger property of complete positivity, introduced earlier 
in the purely mathematical context by Stinespring [174] and studied in detail in the 
finite dimensional case by Choi [36]. Moreover when interpreted in the physical terms, 
the basic Stinespring's dilation theorem implies that complete positivity is not only 
necessary, but also sufficient for a dynamical map to be extendable to the unitary 
dynamics of an open quantum system interacting with an environment. In this way 
the notion of dynamical map in statistical mechanics (and that of quantum channel 
in information theory) was finally identified with that of completely positive, properly 
normalized map acting on the relevant operator space (generated either by states or 
by observables) of the underlying quantum system. A comprehensive study of the 
dynamical aspects of open quantum systems was presented by Spohn [158] . and a 
number of important entropic quantities introduced already at this early stage were 
surveyed by Wehrl [184]. 

Quantum information theory brought a new turn to the whole subject by giving a 
deeper operational insight to the notion of channels and to their entropic characteristics. 
Fundamental results of the classical information theory are the coding theorems which 
establish the possibility of transmitting and processing data reliably (i.e. asymptotically 
error-free) at rates not exceeding certain threshold values (capacities) that characterize 
the system under consideration. Coding theorems provide explicit formulas for such 
thresholds in terms of entropic functionals of the channel. From a different perspective, 
quantum information theory gives a peculiar view onto irreversible evolutions of 
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quantum systems, providing the quantitative answer to the question - to what extent 
effects of irreversibility and noise in the channel can be reversed by using intelligent pre- 
processing of the incoming and post-processing of the outgoing states. The issue of the 
information capacity of quantum communication channels arose soon after publication 
of the pioneering Shannon's paper [160J and goes back to works of Gabor [53] and 
Gordon [65J, asking for fundamental physical limits on the rate and on the quality of 
the information transmission. This laid a physical foundation and raised the question 
of consistent quantum-theoretical treatment of the problem. 

Important steps in this direction were made in the 1970-s when quantum detection 
and estimation theory was developed, making a quantum probabilistic frame for these 
problems, see the books of Helstrom [80J and Holevo [8?]. At that time the concepts 
of quantum communication channel and of its capacity for transmitting classical 
information were established, along with a fundamental upper bound for that quantity. 
From this point the subject of the present survey begins. In 1990-s a new interest on 
noisy quantum channels arose in connection with the emerging quantum information 
science, with a more detailed and deeper insight - e.g. see the books [135] [76] or the 
review papers [161 El 1172} 1107] and the references therein (further references to original 
works will be also given in the present reivew). The more recent developments are 
characterized by an emphasis on the new possibilities (rather than on the restrictions) 
implied by specifically quantum features of the information processing agent, notably - 
the entanglement as a novel communication resource. Along these lines the achievability 
of the information bound and a number of further quantum coding theorems were 
discovered making what can be described as quantum Shannon theory. It was realized 
that a quantum communication channel is characterized by a whole spectrum of 
capacities depending on the nature of the information resources and specific protocols 
used for the transmission. On the other hand, the question of information capacity 
turned out important for the theory of quantum computing, particularly in connection 
with quantum error- correcting codes, communication and algorithmic complexities, and 
quantum cryptography where the channel environment not only introduces the noise 
but also models an eavesdropper interfering with private communication. 

The aim of the present paper is to provide a survey of the principal features of 
quantum communication channels. We will discuss mathematical representations for 
quantum channels and focus on the question - how their efficiency in transferring signals 
can be characterized in terms of various entropic quantities. The material is organized 
as follows. Sec. [2] is devoted to the structural theory of quantum channels. It also gives a 
survey of the main particular classes of channels in the case of finite level quantum system 
with a variety of examples. In Sec. [3] we pass to the "continuous- variables" case, which 
provides an arena for quantum Gaussian systems. Many experimental demonstrations 
of quantum information processing were realized in such systems, in particular basing 
on principles of quantum optics. Several important entropic quantities are introduced 
in Sec. H] and used to describe the basic capacity formulas. Most of content in this 
Section is devoted to the finite level systems and can be addressed straight after reading 
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Sec. [2J In the course of our survey several important open questions are formulated and 
discussed. Summary and outlook are given in Sec. [51 



2. Channels and open systems 

2.1. Classical and quantum information carriers 

In any information-processing scheme, data are encoded into the states of some physical 
systems (e.g. the etched surface of a DVD or the EM microwaves emitted by a cellphone) 
which play the role of information carriers or fundamental data-processing elements. In 
its simplest version, the classical theory of information assigns to each of them a finite 
phase space X (called also alphabet) whose elements represent the possible configurations 
the carrier can assume (e.g. on/off, dot /dash/space in Morse code, etc.). More precisely 
the completely determined states (pure states) of a carrier are identified with the points 
x G X, while the mixed states which represent statistical ensemble of pure states, 
are described by probability distributions {p x } on X . The Shannon entropy of such 
distribution [37] 



provides a measure of the "uncertainty" or of the "lack of information" in the 
corresponding ensemble (also, as it will be clarified in Sec. 14.11 H(X) describes 
the potential information content of a random source producing symbols x with 
probabilities p x ). Choosing the binary logarithm in Eq. ([[]) means that we are measuring 
information in binary digits - bits - which is convenient because of the basic role 
of two-state processing systems. The minimal value of the entropy, equal to 0, is 
attained on pure states (here and in the remaining of the paper we adopt the usual 
convention log 2 = 0), while the maximal, equal to log 2 \ X\, is obtained on the uniform 
distribution p x = \X\~ , where \X\ denotes the size of the alphabet X. 

Such a simple phase space description does not hold when the information carrier 
used to encode the data is a quantum system (e.g. a two-level ion confined in space by 
using strong electromagnetic fields |134] or a single polarized photon propagating along 
an optical fiber [63]). Instead the latter can be represented in terms of a Hilbert space 




% [4"2| 1181] . which again for simplicity we take finite dimensional. In this context, the 
pure quantum states are described by projections onto unit vectors \ip) of H (as 

in the classical case they define the completely determined configurations of the carrier, 
which in the quantum case correspond to specific state preparation procedures). Mixed 
states are represented by statistical ensembles of pure states \ip a )(ip a \ with probabilities 
p a . Each mixed state is formally described by the corresponding density operator 
p = ^2 a p a \' l P a ){' l P a \ which contains, in a highly condensed form, the information about 
the procedure for producing such state (specifically it describes a preparation procedure 
of the carrier characterized by a stochastically fluctuating parameter a), and which has 
the following properties [181] : 




(1) 



X 
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i) p is a linear Hermitian positive operator in H; 

ii) p has unit trace, i.e. Trp = 1. 

Analogously to their classical counterparts, quantum mixed states form a convex 
set &{%) whose extremal points are represented by pure states. Unlike classical mixed 
states however, they can be written as mixture of pure states in many different ways, i.e. 
one and the same density matrix p represents different stochastic preparation procedures 
of the carrier. A distinguished ensemble is given by the spectral decomposition 
p = J2j <\? l e *'}( e ' J |) where Xj are the eigenvalues and |e J ) - orthonormal eigenvectors of 
the operator p. The eigenvalues Xj of a density operator form a probability distribution 
and the Shannon entropy of this distribution is equal to the von Neumann entropy of 
the density operator |181j . 

S(p) = -Tip log 2 p = - A,- log 2 Xj = H({Xj}) , (2) 

j 

which provides a measure of uncertainty and, as it is explained later, of the information 
content of the quantum state p. Again, the minimal value of the entropy, equal to 0, is 
attained on pure states while the maximal, equal to log 2 d, - on the chaotic state p = I/d 
(with / denoting the unit operator in % and d = dim H being the dimensionality of the 
space) . 

There is a way to formally embed a finite classical system associated with a classical 
information carrier into a quantum system by introducing the Hilbert space with the or- 
thonormal basis {\x)} indexed by phase space points x £ X. Then to the classical states 
{p x } of the classical carrier correspond the diagonal density operators p = J2xP^\ x )( x \ 
which commute with each other. Truly quantum systems, however, admit also config- 
urations which are described by density operators that are not simultaneously diago- 
nalizable (i.e. noncommuting). Indeed the full information content of a quantum state 
cannot be reduced to a classical message and therefore deserves special name quantum 
information. This is related to the fact that a quantum state contains implicitly the 
statistics of all possible quantum measurements, including mutually exclusive (com- 
plementary) ones: a distinctive feature which ultimately leads to the impossibility of 
cloning of quantum information (see Sec. 12.31 for detail). 

Example: Take a binary alphabet X = {0, 1} with associated quantum or- 
thogonal basis {|0),|1)}. The density operator |+)(+| corresponding to the vector 
1+) = (|0) + |l))/\/2) is a proper quantum state of the system which in general does 
not commute with the density matrices po|0)(0| +pi|l)(l| used to embed the statistical 
distributions {po,Pi} on X. 

In information theory, both classical and quantum, messages are typically 
transmitted by using block coding strategies that exploit long sequences of carriers. 
Therefore one systematically has to do with composite systems corresponding to 
repeated or parallel uses of communication channels. Entanglement reflects unusual 
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properties of composite quantum systems which are described by tensor rather than 
Cartesian (as in the classical case) product of the component systems. According to 
the superposition principle [32], the Hilbert space Hab = Ha ® "H_b of the composite 
system AB, along with product vectors \ipA ® ^b) = \^a) ® \^b) with \ipA) £ Ha and 
\iJjb) £ Hb-i contains all possible linear combinations \ip% <8> The pure states 
given by product vectors are called separable while their superpositions are entangled. 
A mixed state of AB is called separable if it can be expressed as a mixture of product 
states, and entangled if it can not |185l 1135] . 

Entanglement is an intrinsically "nonclassical" sort of correlation between 
subsystems which typically emerges due to quantum interactions. If a classical composite 
system AB is in a pure (i.e. completely determined) state then apparently the 
subsystems A, B are also in their uniquely defined pure partial states. Strikingly, this is 
not so for the entangled states of a quantum composite system AB. Consider a generic 
pure state \iPab) of such a system. By Schmidt decomposition it can be expressed as 

I^M =J2 V 7 ^ Wa®^b) , (3) 

3 

where {Xj} is a probability distribution which is uniquely determined by \iPab), while 
{\e J AB )} are some orthonormal bases in %a,b [135] . Then the local state of the subsystem 
A is given by the reduced density operator 

p A = Tr B \ip AB )(ip AB \ = ^2 Xj\e\) (e J A \ , (4) 

3 

obtained by taking the partial trace Tt.b[- ■ •] of the original joint state ([3]) over the 
degrees of freedom of B (similarly for the partial state of B). Thus, unless the Schmidt 
form (J3J) factorizes (i.e. the numbers Xj are all but one equal to zero), the density 
matrices of A and B represent mixed states. In particular they have the same nonzero 
eigenvalues Xj and hence equal entropies, 

S {p A ) = S{p B ) = ~^Xj log 2 Xj , (5) 

3 

which are strictly positive if the state \iPab) is a genuine superposition of product 
vectors, i.e. entangled. In fact, the entropy (jHJ) measures how much entangled is the 
state flSj); it is zero if and only if \i/jab) is separable, and takes its maximal value log 2 d 
for the maximally entangled states which have the Schmidt representation ([3]) with the 
uniform coefficients Xj = 1/ \fd (measures of entanglement can also be defined for mixed 
states of AB - we refer the reader to [104] (28J [69] for detailed reviews on that subject). 

This passage from the pure joint state of a composite system to the mixed partial 
state of one of its components can be inverted. Indeed, for any density matrix pa of a 
system A there is a pure state of a composite system AB, where B is large enough to 
contain a copy A, given by the vector (j3J) such that pa is its partial state. This pure 
state is called purification of pa- Nothing like this exists in the classical system theory. 

For the sake of completeness it is finally worth to mention that entanglement is not 
the sole way in which quantum mechanical states exhibit nonclassical features. Another 
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aspect which is currently attracting a growing interest is the possibility of exploiting 
the nonorthogonality of states, like |0) and |+) introduced before, to build correlated 
joint configurations which, even though being separable, cannot be accounted for by a 
purely classical theory. A measure for such nonclassical, yet unentangled, correlations 
is provided by the so called quantum discord, see |139l IBTj for detail. 

2.2. Classical and quantum channels 

A communication channel is any physical transmission medium (e.g. a wire) that 
allows two parties (say, the sender and the receiver) to exchange messages. In classical 
information theory [37J , a channel is abstractly modeled by specifying an input alphabet 
X and an output alphabet y (with y not necessarily the same as X). The elements x 
of X represent the input signals (or letters) the sender wishes to transfer. Alternatively 
one can think of x's as the pure (classical) states at the input of the information carrier 
that propagate through the physical medium that describes the communication line. 
Similarly the elements y of y represent the output counterparts of the input signals 
which arrive to the receiver after the propagation through the medium. The physical 
properties of the communication process, including the noise that may affect it, are then 
summarized by a conditional probability p{y\x) of receiving an output letter y 6 ^ when 
an input signal x G X is sent. Hence, if an input probability distribution P = {p x } is 
given, reflecting the frequencies of different input signals, then the input and the output 
become random variables X, Y with the joint probability distribution p xy = p(y\x)p x . 
Accordingly, the input probability distribution P = {p x } is transformed by the channel 
to the output probability distribution P' = {p' y }, where p' y = J2 x p(y\x)p x . A noiseless 
channel is such that X = y and the probabilities p{y\x) are either or 1 (i.e. it amounts 
to a permutation of the alphabet X). A particular case is the ideal (identity) channel 
Id, for which p(y\x) = 8 x>y (Kronecker's delta). 

Looking for a generalization to the quantum domain we should first consider those 
scenarios in which classical data (i.e. data which could have been stored into the state 
of a classical carrier represented by the alphabet X) are transmitted through a physical 
communication line which employs quantum carriers to convey information (e.g. an 
optical fiber operating at very low intensity of the light [331 [63]). in these configurations 
the signaling process requires an initial encoding stage in which the elements of X are 
"written" into the quantum states of the carrier, and a final decoding stage in which 
the received quantum states of the carrier are mapped back into classical data by some 
measurement, as schematically shown on the diagram of Fig. [TJ The encoding stage of 
such scheme is characterized by assigning a classical- quantum (c-q) mapping x — > p x 
that defines which density matrix p x of the quantum carrier represents the symbol x G X 
of the input classical message. Physically, x can thus be interpreted as a parameter of 
a state preparation procedure (the correspondence x — > p x containing, in a condensed 
form, the description of the physical process generating the state p x ). Notice that in 
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Figure 1. Transferring classical data x £ X via a quantum information carrier. The 
first stage of the process requires the encoding of x into a quantum state p x of the 
carrier (c-q mapping); then the carrier propagates along the communication line and 
its state gets transformed into the density matrix p' x (q-q mapping); finally there is 
the decoding stage where the receiver of the message tries to recover x by performing 
some measurement on p' x and obtaining the classical outcome y (q-c mapping). 



order to preserve the statistical structure of the process, a mixed input state of X defined 
by the probability distribution P = {p x } is mapped by the c-q channel into the density 
operator p = Y^ x PxPx- 

On the other hand, the decoding stage of Fig. [1] is characterized by a quantum- 
classical (q-c) mapping that establishes the probabilities of the output letters y G 
y corresponding to the quantum state of the carrier emerging from the quantum 
communication line. Such a mapping is implemented by a quantum measurement and 
it is characterized by assigning the probability distribution p y (p) that, given a generic 
state p of the carrier, defines the statistics of the possible measurement outcomes. It 
can be shown [87] that the linear dependence of p y (p) resulting from the preservation of 
mixtures, along with general properties of probabilities, implies the following functional 
structure 



where {M y } is a collection of Hermitian operators in % with the properties: 
My > 0,^ J/ Mj / = I. Any such collection is called probability operator-valued 
measure (POVM), or, in the modern quantum phenomenology, just observable (with 
values y) [88]. Later we shall explain how the operators M y arise from the dynamical 
description of a measurement process. An observable is called sharp if M y are mutually 
orthogonal projection operators, i.e. My = M y , M y M y i — if y ^ y'. In this case ([6]) 
amounts to the well-known "Born- von Neumann statistical postulate" [12 | 1181] . 

The c-q and q-c transformations described above are special cases of quantum 
channels with classical input, resp. output, which will be considered in more detail in 
Sec. 12.81 The central link in the scheme of Fig. [1] is what really defines the quantum 
character of the process. It describes the transformation that the state of the quantum 
carrier experiences when propagating through the communication line and it is fully 
characterized by assigning a quantum-quantum (q-q) mapping 



Py(p) 



Tr pM, 



(6) 




(7) 
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which takes input density operators p into output density operators p' according to 
the specific physical properties of the model under consideration, while respecting the 
statistical mixtures, i.e. 



E 



Pa Pa 



(8) 



for all probability distributions {p a } and for arbitrary density operators {p a }- An ex- 
act characterization of such transformations is clearly mandatory as they constitute the 
proper quantum counterparts of the stochastic mapping P — > P' of the classical com- 
munication theory. 



Example: The basic example of a quantum system is the qubit - a two-level 
quantum system (say, the spin of an electron) characterized by Hilbert H space of 
dimensionality 2. Via the specification of a canonical basis in H, its linear operators are 
represented as linear combinations of the Pauli matrices 
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(9) 



This allows us to express the density operators p G &(%) of the system as points 
a = (a x , a y , a z ) in the unit ball in the three-dimensional real vector space M. 3 (the Block 
ball) by identifying the coordinates a x ,a y , a z with the Stokes parameters of the expansion 
(in which the condition Trp = 1 is taken into account) 



p = -(I + a x a x + a y a y + a z a z 



1 + a z 



ia„ 



1 



(10) 



Equation (JSJ) then implies that a qubit channel is a linear map in R 3 transforming the 
Bloch ball into certain ellipsoid inside the ball. Up to rotations at the input and the 
output, such a transformation has the following canonical form in the basis of Pauli 
matrices: 



a 7 — > a' 7 = b 1 + a 7 t 7 , j — x,y,z, (11) 

with 6 7 ,t 7 real numbers (satisfying some further constraints to be discussed later). The 

= 1. Hence 6 7 give the coordinates 



equation of the output ellipsoid is ^ 
of the center of the output ellipsoid, while |i 7 | its half-axes. 



"f=x,y,z 



2. 3. Dynamics of isolated and open quantum systems 

The simplest case of the q-q mappings ([7]) is represented by the noiseless quantum 
channels which describe reversible transformations of the set of an isolated quantum 
states ©("H) onto itself. The famous Wigner's theorem implies [H] that any such 
mapping is implemented by either unitary or anti-unitary conjugation, which amounts to 

<f>[p\ = U plP , or = U p T U ] , (12) 

where U is a unitary operator on the system Hilbert space "H, {-y denotes the Hermi- 
tian adjoint, while (-) T is the matrix transposition in a fixed basis. For a qubit, the 
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mappings in Eq. (1121) define, respectively, rotations of the Bloch ball (orthogonal maps 
with determinant +1), and reflection with respect to xz-plane (i.e. matrix transposition 
in the canonical basis) followed by rotations (orthogonal maps with determinant — 1). 
It turns out however, that only the unitary conjugations (the first case in (fT2"]l ) can be 
associated with proper dynamical processes, the anti-unitary being excluded since they 
cannot be continuously connected with the identity mapping (the transposition p — > p T 
is indeed typically identified with time reversal). 

As it was already mentioned in Sec. 12.11 a basic feature of quantum information as 
distinct from classical is the impossibility of cloning [193j . Clearly, any classical data 
can be copied exactly in an arbitrary quantity. But a "quantum xerox", i.e. a physical 
device which would accomplish a similar task for arbitrary quantum states contradicts 
the principles of Quantum Mechanics. Indeed, the cloning transformation 

\^)eU — > eH® n = H®---®H , (13) 

V v ' 

n 

is nonlinear and cannot be implemented by a unitary operator (even if operating jointly 
on some external ancillary system that is traced away at the end of the process). Of 
course, this can be done for each given state (and even for each fixed set of orthogo- 
nal states) by a corresponding specialized device, but there is no universal doner for all 
quantum states. It should be noted also that approximate cloning of arbitrary quantum 
states is allowed, as long as the resulting transformation realizes Eq. (113"]) to a certain 
known degree of accuracy, see e.g. Ref. [151] . The quantum xerox is not the only type of 
a machine which is forbidden by the laws of Quantum Mechanics. For a comprehensive 
list of such "impossible devices" we refer the reader to Ref. [186J. 

The evolution of an open system, subject to exterior influences, whether it be the 
process of establishing equilibrium with an environment or interaction with a measuring 
apparatus, reveals features of irreversibility. These transformations constitute the 
class of noisy quantum channels (Jj[J) which cause distortion to the transmitted states 
of a quantum carrier in its propagation through the communication line. A formal 
characterization of such processes can be obtained by introducing a Hilbert space He to 
describe the environmental degrees of freedom E which tamper with the communication, 
and assigning to it an initial state pE- Since the carrier and the environment together 
form an isolated system, their joint (reversible) evolution can be described by a unitary 
operator U which, acting nontrivially on the composite Hilbert space H ® He, defines 
their interaction - see Fig. |2j Consequently the irreversible q-q mappings (GO) can be 
obtained by averaging off E from the resulting output configurations, i.e. 

p — ► %] = Tie [U (p ® Pe ) lP] • (14) 

This expression provides what is generally called a unitary representation for the 
quantum channel $ and generalizes the unitary evolutions of Eq. (fT2l) to the case of 
irreversible dynamics. Let us then assume that the initial environment state is pure, i.e. 
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=Tr e [U{ P ®Pe)U^] 



Figure 2. Schematics of the unitary representation (fH| for the quantum channel $. 



Pe = \4 ) e)(4 ) e\ - via purification of pe this is always possible: the resulting expression 
is often called the Stinespring representation of $ |174[ 1117] . In this case Eq. f lT^|) can 
be written as 

%] = Tt E VpV* , (15) 
with V being an isometric operator from % to "H <8> He defined by 

d E 

V = U\i; E )=J^V k ®\e E ) , (16) 

k=l 

where d E is the dimensionality of T-L E while {|e| ; )} fc _ 1 d is one of its orthonormal 
basis (recall that a linear operator from one space to another is isometric if it preserves 
the norms and hence inner products of vectors. In particular it satisfies the relation 
V^V = I with / being the identity operator of the input space). 

In this expression 14 = (e E \U\ip E ) are operators acting on 7-L which are uniquely 
determined by the relation 

(cp\V k \iP) = ( V ® e k E \U\i; ® 1>e) , \<p), Vl>) e n. (17) 
They are called Kraus operators and satisfy the completeness relation 

d E 

E v * Vk = 1 ■ ( 18 ) 

k=l 

Taking the partial trace in ( TT5l) in the basis {|e| ; )} fc _ 1 d we finally arrive at the so 
called operator-sum (or Kraus) decomposition |117] for the evolution (fl4|) 

m = Yy*pvl ■ as) 

k=l 

12 



It is important to notice that given a quantum channel $ expressed as in Eq. (1191) 
with some set of operators {V&}, it is always possible to extend it (in a highly non-unique 
way) to the open system dynamics (1141) with some effective environment E initialized in 
a state pe, and a unitary transformation U that couples it to the system (or equivalently 
in terms of an isometric transformation V that maps H, into T-L^I-Le as in Eq. (JT5l)). 
Furthermore, even the Kraus decomposition for a given channel $ is not unique. Indeed 
given a Kraus set {V^} for $, a new Kraus set {W^}, which represents the same quantum 
channel, can be obtained by forming the following linear combinations 

W^J^UjkVk, (20) 

k=l 

where [ujk] is any complex matrix which satisfies the isometry constraint 
^2jUk'jUjk = 5kk'- Let us stress that here, as anywhere in quantum information sci- 
ence, "environment" means those degrees of freedom of the actual physical environment 
of the open quantum system which are essentially involved in the interaction and in the 
resulting information exchange with the system (cf. the notion of "faked continuum" 
in |173j ). Therefore the question of such a minimal environment for a given channel 
arises quite naturally (clearly the Kraus decomposition with the minimal number of 
non-zero components oIe is unique only up to transformations of the form (120]) with Ujf. 
being now unitary matrices; on the contrary, the size cLe of the minimal environment is 
an important invariant of the channel $ reflecting its "noisyness" and irreversibility). In 
the minimal decomposition the Kraus operators are linearly independent, so that their 
number cannot exceed the dimensionality d 2 of the space of linear operators acting on 
the system - for instance, the reversible channels of Eq. f lT2|) have a minimal decompo- 
sition with a single Kraus element (the unitary U which defines them). 

Example 1: The depolarizing channel (with probability of error p) is given by the 
formula (in which p is an arbitrary operator, not necessarily of unit trace) 

$[p\ = (l-p)p + p I - Trp, (21) 

where dim'H = d. This relation describes mixture of the ideal channel Id : p — > p and 
of the completely depolarizing channel p — > (I/d)Trp which transforms any state p 
into the chaotic state I jd. For the depolarizing channel a Kraus decomposition can be 
obtained by writing the unit operator and the trace in a fixed basis {(e- 7 )}: 

d 

S\p] = (1 - p)V oP vS + V *P V ii ' ( 22 ) 

where Vq = I, Vij = |e*)(e J '|. Such Kraus set is clearly not minimal since its elements are 
linearly dependent (e.g. Vq = Y^t=i Va)- 

Example 2: As another example consider the qubit channel (fill) with 6 7 = 0, 
that contracts the Bloch ball along the axes 7 = x,y,z with the coefficients |t 7 | 
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(notice that qubit depolarizing channel corresponds to the uniform contraction with 
t x — t y = t z = 1 — p). By using the multiplication rules for Pauli matrices one sees that 
in this case 

$ W = Yl Pi a i P a 7> ( 23 ) 
7=0, x, y, z 

where 

p = (l + t x + t y + t z ) /A, p x = (l + t x -t y -t z ) /A , (24) 

Py =(l-t x + ty ~ t z ) /4 , Pz=(l-t X -t v + t Z ) /4 , (25) 

and the nonnegativity of these numbers is the necessary and sufficient condition for the 
complete positivity of the map $ (see below) . In this case (123]) gives the minimal Kraus 
decomposition for the channel (the size of the minimal environment being equal to the 
number of strictly positive coefficients p 7 ). 



Coming back to the decomposition ( JT9l) we see that it can be written as 

d E 

<%] = $>fc(p) Pfc , (26) 

k=l 

where Pk{p) = TrpM^ is the probability distribution associated with a POVM of 
elements M k = V k V^ and p k = V k pV^ /pk{p) are density operators. Noticing 
that VkpVl = {e k E \U {p® \i/;E)(ipE\)U j! \e%), the relation ( j26l) can then be interpreted 
as follows: after the system and the environment have evolved into the state 
U (p® \ip e) e\) via the unitary coupling U, in the environment a von Neumann 
measurement in the basis (|e^)} is performed [181] : the outcome k appears with 
probability Pk(p) and the posterior state of the system conditioned upon the outcome k 
is the density operator p k . In other words the formula ( |26|) gives the decomposition of 
the system state $[p] after the (nonselective) measurement into ensemble of posterior 
states corresponding to different measurement outcomes. 



2.4- Heisenberg picture 

So far we worked in the Schrodinger picture describing evolutions of states for fixed 
observables. The passage to the Heisenberg picture where the observables of the system 
evolve while the states are kept fixed, is obtained by introducing the dual channel $* 
according to the rule 

Tr$[p] X = Tip $*[X] , (27) 

which must hold for all density operators p G &{T-L) and for all operators X in H. The 
map $* is also completely positive admitting the operator-sum decomposition 

^■[X} = J2v^XV kj (28) 

k 
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where {V&} is a Kraus set for <£>. The property of trace preservation for $ is equivalent 
to the fact that $* is unital, i.e. it leaves invariant the unit operator J: $*[/] = /. 
A channel $ in the Schrodinger picture may also occasionally be unital, in which case 
it is called bistochastic; it leaves invariant the chaotic state I/d. This happens always 
when the channel is self-dual: $ = $*. An example of such a channel is provided by the 
depolarizing map of Eq. (121]) . 

2.5. Positivity, complete positivity and the Choi-Jamiolkowski representation 

In accordance with the fact that the channel <3> must transform quantum states into 
quantum states, Eq. (Tl9|) ensures preservation of positivity (indeed p > implies 
VkpVfr > 0), while Eq. (fl8l) guarantees that the mapping $ preserves the trace (i.e. 
Tr$[p] = Trp). Thus, for arbitrary linear operators p in H, the transformation ([191) 
defines a positive, trace preserving, linear mapping. 

Quite importantly, it satisfies an even stronger property called complete positivity 
which means that any extension $ eg) 16.R of the channel $ by an ideal channel Id# of a 
"parallel" reference system R is again positive. In other words, given a quantum channel 
$ operating on the system A which admits the Kraus decomposition, for any state par 
of the composite system Ha <8> Wr the equation 

^®\d R )[p AR ]=p' AR , (29) 

defines again a proper density operator p' AR (indeed, ($ <g) Id^)^] = Y^kLi^Yk ® 
lR)PAR(Vk <S> IrY > while the trace is obviously preserved). The complete positivity 
together with the trace preservation (CPTP in brief) are necessary and sufficient for a 
linear map $ to have a Kraus decomposition ffl9l) and thus, a unitary representation (fT4j) ; 
hence the CPTP conditions are characteristic for the evolution of an open system and 
can be used to define a quantum channel. 

When the initial state par is nonseparable, the transformation described in Eq. (1291) 
is called "entanglement transmission" . At the physical level it represents those processes 
by which the sender creates locally some entangled configuration of the systems A and R, 
and then sends half of it (specifically the carrier A) to the receiver through a q-q channel. 
The fact that the mapping $ entering in Eq. (1291) is completely positive guarantees that 
the resulting configuration can still be represented as a density operator, and hence 
that the overall transformation admits a quantum mechanical description. Notably not 
all positive maps which take the set of states &{T-La) of a given system A into itself 
are completely positive. The canonical example is the transposition T[p^] = p\ in a 
fixed basis {(ej 1 )}, see the second relation in Eq. ([121) . Since p\ and pa share the same 
spectral properties, it is clear that T maps the states of A into states (hence it is positive). 
However applying the map T <g> Id^ to a maximally entangled state ^7= J2j Wa ® e j?) °f 
the composite system AR one obtains the operator 



d A 



d A 

E l e A®4)(^®4l 

■M=l 



dA . 
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which is not a proper state of AR since it is not positive (its expectation value on 
the vector \e\ ® e R ) — \e A £g> e^) is negative). Such a weird behavior of T should not 
be surprising: as anticipated at the beginning of the section, in physics transposition 
is related to the time inversion. The operation T <g> Id# is thus something like doing 
time inversion only in one part of the composite system AR. To avoid this unphysical 
behavior, one should hence work under the explicit assumption that the q-q mappings 
entering the Eq. (j7]) satisfy to the complete positivity condition. 

In the case of CPTP mappings, the right hand side of Eq. ( 1291) obtained by 
substituting the maximally entangled state associated with the vector ^= £\ |e^<g>e^) 
for par, provides the Choi-Jamiolkowski state of $: 

1 . dA . 

P?i=l-E $ 04)(eiO«)|ei J >(ei| ) (30) 



d A 



which uniquely determines the channel through the inversion formula 

$[p A } = d A Tr R [pW(I A ®pl)] . (31) 

Here p^ = k {e- A \p A \e J A )\e R )(e R \ is the state of R which is constructed via 
transposition of the density matrix of p A . 



2.6. Compositions rules 

Suppose that we have a family {$ Q } of quantum channels where the values of the 
parameter a appear with probabilities p a . Then the mapping defined as 

a 

is still CPTP and hence defines a channel (with a fluctuating parameter a). In 
other words, the set of channels is convex, i.e. closed under convex combinations 
(mixtures). The extremal (pure) channels are those CPTP transformations which admit 
a Kraus decomposition f fl9|) with V*. such that {VfcV fc t}fc,fc' form a collection of linearly 
independent operators |36j (in particular, the unitary evolutions ( TT2l) are extremal). 

The set of channels possesses also a semigroup structure under concatenation. 
Indeed the transformation $20$! obtained by applying $ 2 at the output of the quantum 
channel $1, i.e. 

($ 2 o$ 1 )[p] = $ 2 [$ 1 [p]] , (33) 

is still CPTP and hence a channel. Notice that the product "o" is in general neither 
commutative (i.e. $ 2 ^1 7^ $1 o $ 2 ), nor it admits inversion (the only quantum 
channels which admit CPTP inverse are the reversible transformations defined by 
the unitary mappings ffl2|) ). The possibility of concatenating quantum channels with 
unitary transformations turn out to be useful in defining equivalence classes which, 
in several cases, may help in simplifying the study of the noise effects acting on a 
system. In particular two maps $1 and <3> 2 are said to be unitary equivalent when 
there exists unitary transformations V and U such that $ 2 = &u o Q 1 o <& v (here 
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$[/[•] = £/[•][/' is the quantum channel associated with the unitary U, similarly for 
$y). For instance, exploiting this approach, classifications for the quantum channels 
operating on a qubit |149] or on a single Bosonic mode [HU ED, [32] have been realized 
(see also Sec. 13. 5p . 

Finally, the complete positivity allows us to consider tensor products of channels 
which are most important in information theory, where tensor products describe parallel 
or block channels. Indeed, one can write the tensor product as concatenation of 
completely positive maps 

$1 g> $ 2 = ($i © Id 2 ) o (Idi g> $ 2 ) , (34) 

which is again completely positive. 

2. 1. Complementary channels 

For simplicity till now we considered channels with the same input and output spaces. 
From the point of view of information theory, however, it is quite natural to have pos- 
sibility for them to be different. This is done by considering transformations (|T9l) with 
operators 14 mapping the input space Ha into the output space He while still satis- 
fying the completeness relation (fIBI) . or equivalently, by considering in ( fl~5|) isometric 
transformation V which connects Ha to He ® He- In such cases we shall often use 
self-explanatory notation $ = <&a-+b- 

Example: The quantum erasure channel [68j [13] gives a natural example of a 
channel with different input and output spaces. It represents a communication line 
which transmits the input state p intact with probability 1 — p and "erases" it with 
probability p by replacing it with an erasure signal |e) that is orthogonal to p. Formally 
it can be defined as the CPTP map 

p — > (1 — p) p © p |e)(e| Trp , (35) 

which operates from the input space Ha to the output space He = Ha © obtained 
by "adding" an extra orthogonal vector |e) to Ha- 

A quite important case where the input and output systems differ arises in the 
open system description (jT4"I) of a channel $ that maps a system A into itself. Indeed 
suppose that, instead of the evolution of A, we are interested in the state change of the 
environment E as a function of the input state p of A. This is given by the following 
mapping 

p — ► Tr A U(p®p E )U\ (36) 

where in contrast to Eq. f[T4l) . the partial trace is taken with respect to H = Ha instead 
of He- The transformation f[36l) defines a quantum channel that connects states in Ha to 
states of the channel environment. There is a complementarity relationship between the 
mappings (|T4|) and ([36]) which reflects various aspects of information-disturbance trade- 
off in the evolution of open quantum system. This relation becomes especially strict in 
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the case of pure environmental state pE when it amounts to the notion of complementary 
channels (if p# is a mixed state such complementarity is weakened and the two maps are 
sometimes called weakly complementary [32]). More generally, complementary channels 
can be defined also in a three quantum systems settings. Given the spaces Ha, He, He 
that represent the systems A, B, E, and an isometric operator V : Ha — > He <8> He 
which connects them, we say that the mappings 

$(p) = Tr E VpV^ , $(p) = Tr B VpV* , (37) 

with p being a density operator in Ha, define two channels which are complementary 
PZFJ |90j 1112] (notice that they reduce to Eqs. (tHJ) and (1361) . respectively, in the case 
B = A). Expressing the isometry as in Eq. (fl6|) where again {|e^)}fe is an orthonormal 
basis of He while the V&'s are now operators from Ha to He, the above relations can 
be rewritten as 

d E d E 

m = E v *pvl ' *m = E i e ^i Tr 

k=l k,£=l 

the first one being a Kraus decomposition for $. It can be shown that the second 
relation in Eq. ( 1381) determines all complementary channels of $ uniquely up to uni- 
tary equivalence, moreover, by using a similar construction for $ one can check that 
its complementary is $ = $. Similar relation does not hold for the case of weakly 
complementary channels which are not unique and depend on the selected unitary rep- 
resentation (TH1) of the original map $. 

Example: A complementary channel associated with the identity map Id on H 
(i.e. Id[p] = p for all p), is any map $ that transforms all p into a fixed pure output 
state \iPe) of the environment, i.e. 

$(p) = |^)(^|Trp, (39) 

(to see this put Ha = Hb in Eq. ( 1371) and take the isometry V : Ha —> Ha ® He to be 
the operator V = I <8> \i[>e))- Further, a channel that transforms the density operators 
p into a fixed non-pure output state pe in He is weakly complementary to the identity 
channel Id. (Similarly one finds that complementary to the erasure channel fl35|) is again 
an erasure channel where parameter p has been replaced by 1 — p). 

This example illustrates the fact that perfect transmission of quantum information 
through a quantum channel is equivalent to absence of quantum information transfer 
from the input to the environment E, and vice versa. An approximate version of this 
complementarity expresses the quantum principle of information- disturbance trade- off. 
the closer is the channel $ to an ideal channel, the closer its complement $ to the 
completely depolarizing one |^0] . 

2.8. Quantum channels involving a classical stage 

As shown in Fig. (TJ the process of transmission of classical information through a 
quantum communication line includes encoding and decoding stages which are special 
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cases of quantum channels with classical input, resp. output. Furthermore, a quantum 
information processing system such as a quantum computer, may include certain sub- 
routines where the data have essentially classical nature. Here we characterize such 
mappings by describing them as special instances of CPTP maps. 

(i) The classical-quantum (c-q) channel. In Sec. 12.21 we have seen that this class of 
channels is uniquely defined by a map x — > p x describing encoding of the classical 
input x G X into quantum state p x in the output space He- This process can be 
represented as a quantum channel by introducing the input space Ha spanned by 
an orthonormal basis {|e^)} x ., so that 

x&X 

where p is a density operator in Ha (accordingly output ensembles of the form 
p = J2 x p x Px are generated by taking the input states p such that (e A \p\e A ) — p x ). 
An extreme case of c-q channel is the completely depolarizing channel, describing 
the ultimate result of an irreversible evolution to a final fixed state as in Eq. ( 139]) . 

(ii) The quantum-classical (q-c) channel. Similarly to the previous case, (q-c) 
transformations can be represented as a CPTP map connecting an input quantum 
system B' to an output quantum system A', by introducing for the latter an 
orthonormal basis {\e y A ,)} ye y to represent the classical outcomes y, 

P — ► ®T\a> [P] = E \ C A>) I ^P M V > ( 41 ) 

yey 

where {My] is a POVM in the input space Ha 1 - 

The transmission of classical information through a quantum communication line 
can now be modeled as concatenation of such maps, i.e. <&% C 1^ A , o &b^b' ° &a^Ib with 
<&B^tB' being a generic CPTP map from B to B' . Of particular interest in quantum 
information theory is also the class of maps that arise when the ordering of such a 
concatenation is reversed. In particular, identifying A with A 1 and assuming that the 
bases {le^^}^ and {\eA) y } y coincide, we obtain a q-c-q channel from B' to B of the 
form 

p — ► &£? B [p] = o [p] =J2px TrpM x , (42) 

X 

which describes quantum measurement on B' followed by a state preparation on B 
depending on the outcome of the measurement. Such mappings include the c-q and q-c 
channels as special cases (i.e. c-q channels are obtained by taking M x to be projections 
onto the vectors of orthonormal basis, while q-c channels are obtained when p x are 
orthogonal pure states). 

The channels defined above are characterized by the property of entanglement- 
breaking: operating with them on the half of entangled state Pb'r of a composite system 
B'R produces unentangled state of BR (in particular, their Choi- Jamiolkowski state (l30|) 
is separable). This is not surprising, because in the q-c-q channel the quantum infor- 
mation passes through an intermediate stage (represented by the system A) in which 
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information can only be encoded as a classical state. Less obvious is the fact that any 
entanglement-breaking channel has the form (|42p |101j . Quite non-obviously, the depo- 
larizing channel ( 1211) is entanglement-breaking if the error probability p > d/(d+l) [17 \. 

In all the cases described above the complete positivity can be seen by explicitly 
producing a Kraus decomposition for the map. For instance, let $ be an entanglement- 
breaking channel of the form ( 142)) . Then by making the spectral decompositions of the 
operators p x , M x one arrives to a Kraus decomposition with rank one operators of form 

$[ P ] = j2\v a )(r\p\r)(v a \, (43) 

a 

where \ip a ) are unit vectors while \ip a ) form an overcomplete system for the input 
space, i.e. 

Y J mm = i. (44) 

a 

According to the general formula ( )38|) . the complementary map associated with Eq. (|43|) 
is now obtained as 

$H = ^c^|e a )(rH^)(e /3 |, (45) 

where {|e Q )} is an orthonormal set, and c a p = (ip l3 \ip a ) is a nonnegative semi-definite 
matrix with units on the diagonal. In the case where the input and output spaces 
of $ coincide and \tf) a ) = \e a ), such channels amount to elementwise multiplication 
of the matrices [(e a |p|e^)] and [c a p], called Schur (or Hadamard) product. Such 
channels are called dephasing [UJ (or diagonal |112j ) since they suppress the off-diagonal 
elements of the input density matrix. From ( 1431) we see that the dephasing channels 
are complementary to a particular class of entanglement-breaking channels, namely 
to c-q channels. For another special subclass of the q-c channels, {|<^ a )} of Eq. (|43|) 
constitute an orthonormal base, so that c a/ 3 = 5 a p and the complementary channel is 
a q-c channel which is called completely dephasing since it amounts to nullifying the 
off-diagonal elements of the density matrix [(e^lple^)]. 

3. Bosonic Gaussian channels 

Until now we have considered quantum channels from rather abstract point of view 
focusing on models in which the quantum carriers are effectively described as finite 
dimensional systems (say, collections of qubits). In many real experimental scenarios 
such a description is valid only approximately. To begin with, the fundamental physical 
information carrier is the electromagnetic field which is known to be mathematically 
equivalent to an ensemble of oscillators that in Quantum Mechanics are described 
as infinite dimensional systems [6U I114[ I127[ |194j. This is a typical example of 
"continuous variables" Bosonic quantum system [271 135] whose basic observables 
(oscillator amplitudes) satisfy the Canonical Commutation Relations (CCR) (other 
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examples include vibrational modes of solids, atomic ensembles, nuclear spins in a 
quantum dot and Bose-Einstein condensates). Many of the current experimental 
realizations of quantum information processing are carried out in such systems [331 
[271 ESI E21 [63] . In particular, the most impressive examples of quantum communication 
channels have been realized in this way by using optical fibers [1 761 11291 11751 1177] or 
free space communication |152[ 1178] 1140] . 

In the context of quantum optical communication [33] particularly important are 
the Gaussian states, including coherent and squeezed states realized in lasers and 
nonlinear quantum optical devices, and the corresponding class of quantum information 
processors - the Gaussian Channels [96], HHJ [29] 11821 1183] . These last provide the 
proper mathematical description for the most common sources of noise encountered 
in optical implementations, including attenuation, amplification and thermalization 
processes [96] [33]. This Section is devoted to review of the basics properties of this 
special class of channels. 



3.1. Example: channel with additive Gaussian quantum noise 

As a starter consider the case of a single electromagnetic mode which describes the 
propagation of photons of a given frequency u and fixed polarization through an optical 
fiber. The physics of the system is fully determined by the annihilation and creation 
operators 

' 'uQ + iP) , a f = -= (uQ - iP) , (46) 



'2u y2uj 
where Q, P are the canonical operators (quadratures) which effectively represent the 
field and satisfy the Heisenberg CCR 

[Q,P)=U, (47) 

(in the Schrodinger representation Q = x and P = i _1 d/dx, so that the physical 
momentum operator is p = HP). Suppose that initially the mode is in the thermal 
(Gibbs) state represented by the density matrix 

_ exp [—pH\ 
lr exp |— pH \ 

where H — ■= (u 2 Q 2 + P 2 ) = u(a)a + |) is the Hamiltonian of the system while 
(3 > is the inverse temperature of the state po related to its average photon number 
iV = Trp a^a by the identity j3 = u~ l In For fi G C (the set of all complex numbers) 
define the displaced version of po as the state 

p, = D^poD^fi) , (49) 

which can be obtained, with good approximation, by pumping the mode with a laser 
far above threshold [6H 1194] . In the above expressions fi is the complex amplitude of 
the mode, while D(/j) = exp yo) n — a/2) is the displacement operator which effectively 
describes the action of the laser. 
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One can consider the mapping p — > p M as a c-q channel (14"U|) with continuous 
alphabet C, which encodes the signal p into the quantum state p p . From a mathematical 
point of view it is the simplest channel describing transmission of a classical signal 
on the background of additive quantum Gaussian noise. This model can be naturally 
generalized to describe many-mode (broadband) c-q channels [33, 86J also in the presence 
of squeezing [95] . 

3.2. Multimode Bosonic systems 

A Bosonic system with s degrees of freedom (modes) can be described in terms of 
canonical observables Q%, P\ t • • • , Q s , P s which satisfy the Heisenberg CCR 

[Qj, P k ) = i5 jk , [Q v Q k ] = [P v P k ] = . (50) 

In quantum optics [6411194] they coincide with the system quadratures, and are related to 
the Bosonic creation- annihilation operators via the identities at = (Qj — iujjPj) / a/2w~, 
a j = (Qj + i^jPj) I a/2Wj, where ujj are the frequencies of the modes. Introducing 
the unitary Weyl operators W(z) = exp[iRz], where R = [Qi,P\ t ■ ■ ■ ,Q S ,P S } and z is 
the column vector of real parameters z = [xi,yi t . . . ,x s ,y s ] T , the relations (!50|) can be 
rewritten in the equivalent Weyl- Segal form 

W{z)W(z') = W{z + z') exp[^A (z, z')} = W(z')W(z) exp[iA (z, z')} , (51) 
where 

s 

i=i 

is the canonical symplectic form given by the skew-symmetric block matrix 



A = diag 



-1 

1 



(53) 

j=l,...,s 

The 2s- dimensional real vector space M? s equipped with f[5^|) is called the symplectic 
space and describes the classical phase space underlying the quantum Bosonic 
system [87J. 

The Weyl operators are closely related to displacement operators by inducing the 
translations W Jf (—Az)RW(—Az) = R + z T . Moreover, any reasonable function of 
the canonical observables {Qj,Pj} is or can be approximated by a complex linear 
combination of the Weyl operators W(z). In particular, a state p of the system is 
uniquely described by its characteristic function <f) p (z) defined by the relations 

d 2s z 

W) 

where in the second expression the integral is performed on the whole space M? s . A 
state p is called Gaussian if <fi p {z) has the typical Gaussian form 

4> p (z) = exp ^im T z — -z 1 az^j , (55) 
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<f> p (z) = TrpW(z) , p= p (z)^(-z) 7 ^ , (54) 



where m G M 2s and a is a real symmetric 2s x 2s matrix. Similarly to the case of 
classical Gaussians, the components of m T are mean values, i.e. the expectations of the 
elements of R = [Qi, -Pi, ... , Q s , P a ], and ct is their (symmetrized) correlation matrix in 
the state p. However in the quantum case a is not only nonnegative definite but satisfies 
the matrix (Robertson's) uncertainty relation 

a > ±iA/2 . (56) 

Examples of Gaussian states include vacuum, coherent, squeezed states and also thermal 
states of oscillator systems, as well as their displacements induced by action of a fixed 
source. For instance the one-mode state (1491) is Gaussian with the characteristic function 



z) = exp 



{im q x + m p y) ~\\ N +\) ( x2 + V 2 ) 



(57) 



where z = [x, y] T , p = -4= (um, + im p ), while Eq. (156]) amounts to N > 0, see e.g. [8 



3.3. Quantum Gaussian channels 

We now analyze those CPTP maps p — > p' = $ [p] which, when operating on Gaussian 
input states, preserve their structure. It turns out that a convenient way to represent 
such mappings is to prescribe the action of their duals $* (defined in Sec. I2.4[) on the 
Weyl operators W(z). One can show that the requirement that the resulting dynamics 
induces a linear transformation of the canonical observables of an open Bosonic system 
interacting with a Gaussian environment imply 

$*[W(z)] = W{Kz) exp ( il T z - -z T (3z) , (58) 



2 

where I is the vector in M? s while K and j3 are real 2s x 2s— matrices which satisfy the 
following uncertainty relation 

0>±i[A- K T AK] /2 . (59) 

This inequality is a necessary and sufficient condition for the complete positivity of 
$ [131 [961 EH]- Thus, a Gaussian channel is completely characterized by the three 
quantities (K,l,{3) , where (3 satisfies ( !59l (note that there is no restriction on /). As 
any completely positive map, a Gaussian channel admits Kraus decomposition, explicitly 
described in |105] in the case of one mode. 

By using Eqs. (127D and (15^1) one can verify that the mean m and the correlation 
matrix a of an input state p are transformed by the channel according to the relations 

m^m = K T m + I , a ->• a = K T aK + (3 , (60) 

which, due to Eq. (|59|) . guarantee that the output counterparts of the Gaussian 
inputs ( pyfl) are again Gaussian states. 

A special subclass of the Gaussian channels defined above is obtained for (3 = 0. 
Under this constraint Eq. (159]) forces the matrix K to be symplectic, i.e. to fulfill the 
condition 

K T AK = A . (61) 
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Equation ( 15 8 p implies that these special channels induce the following linear 
transformation of the canonical observables 

R-># = &[R] = RK + l T , (62) 

whose components still satisfy the CCR (15 Op due to Eq. ( IBTI) (for this reason the 
mapping ( 1621) is called canonical) . These transformations represent a unitary evolution of 
the system which, in the Schrodinger representation, can be expressed as a concatenation 
of two elementary processes in which one first shifts the coordinates of the system, 
and then applies a proper symplectic transformation. The first process amounts 
to the unitary transformation induced by the Weyl operator W(—Al), i.e. $[p] = 
W(—Al)pW^(—Al). The second process can be described by a unitary transformation 
Uk in % (quantization of the symplectic matrix K) such that [87] 

U K W(z)U K = W(Kz) , hence U ] K RU K = RK . (63) 

Any reversible dynamics of Bosonic system which in the Heisenberg picture is linear in 
the canonical observables as in Eq. ( 1621) can be described in these terms. This is the 
case for the unitary transformations exp[iH] induced by Hamiltonians H which are at 
most quadratic polynomials of the canonical observables, such as the free Hamiltonian 
of a set of harmonic oscillators, or the Hamiltonian governing a linear amplifier optical 
process |194j . 

The presence of irreversible, noisy dynamics for Gaussian channels is signaled 
by having (3 ^ in Eq. (158]) . In this case it is possible to produce a unitary 
representation (I14p of $ in terms of a unitary coupling with a multimode Bosonic 
environment E characterized by the canonical observables Re that induces the following 
linear input-output relation 

R^ R = RK + R E K E , R E ^ R' E = RL + R E L E , (64) 

where T — [ £ B £ B ] is a symplectic 2 (s + se) x 2 (s + se) matrix (se is the number of 
environmental modes entering the unitary representation). In this framework the initial 
state pe of the environment E is chosen to be Gaussian [291 SO] with the correlation 
matrix a s satisfying (3 = K^oeKe (the minimum value of se under the assumption 
that E is initialized in a pure state computed in Ref. [30] is found to be equal to the rank 
of the matrices [/3 ± i(A — K T AK)/2~j ). The second relation in Eq. ( |64l . depending 
on the (non-)purity of Pe, describes the (weakly) complementary channel p — > p' E of $ 
which is also a Gaussian channel. 

3.4- General properties of the Gaussian channels 

Before discussing some specific example of Gaussian channels in the next Section we 
shall list their general properties [96] : 

(i) Gaussian states are transformed into Gaussian states. 

(ii) The dual of a Gaussian channel transforms a polynomial in the canonical variables 
into polynomial of the same degree. 
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(iii) The concatenation of Gaussian channels is again a Gaussian channel. In fact, let 
$jj j = 1, 2, be two Gaussian channels characterized by the parameters (Kj, lj, /J,), 
then by using ( |58i) . the composite map $ 2 o $ x is Gaussian with parameters 

K = K ± K 2 , 

I = Kjh + l 2 , (65) 
P = K~l$ x K 2 + . 

(iv) Gaussian channels are covariant under the action of Weyl operators. That is for all 
p one has 

<&[W\z) p W(z)\ = W\K'z) <%] W(K'z) , (66) 

where K' = A -1 .?r A. Due to this property, the displacement parameter I 
entering ( 1581) can always be removed as it can be compensated by a proper unitary 
transformation at the input or at the output of the communication line. 

(v) If K is invertible, then $[J] = | detf^| -1 /, in particular, $ is unital if and only if 
| det K\ — 1 [9l]. If $ is Gaussian channel with parameters (K, I, /3), then | det K\§* 
is Gaussian channel with parameters (K~ x , — (i^ -1 ) T Z, (K~ l ) J f3K^ 1 ) (for channels 
in one mode this duality was observed in |105| ). 

(vi) Let $ be quantum Gaussian channel with parameters (K, I, 0). It is entanglement- 
breaking if and only if j3 admits the decomposition [93] 

P = a + u, where a > , v > ^K T AK . (67) 

In this case $ admits a representation of the form 

$[p] = J W(z)a B W\z)p p (z)d 2s z , (68) 

where p p {z) is the probability density in the state p of a Gaussian measurement 
with outcomes z, followed by the displacement W(z) of a Gaussian state ob with 
covariance matrix a. Gaussian measurements are those POVMs which transform 
quantum Gaussian states into Gaussian probability distributions on M 2s , see e.g. [87] 
for detail. In quantum optics these are implemented by optical homo- and/or 
hetero-dynes combined with linear multiport interferometers [194J. 



3.5. The case of one mode 

For "continuous variables" systems the one-mode channels (s = 1) play a role similar to 
qubit channels for finite systems. Therefore it is interesting and important to have their 
classification under unitary equivalence with respect to canonical transformations f )62p . 
the problem a solution for which is given in [9T, 132] . In what follows we consider the 
most relevant examples of those maps whose efficiency in transferring information will 
be presented in Sec. 14.101 
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(i) Attenuation channels. These are characterized by transformation (j58p with I 
and 



K = k 



N + 



k' 1 



(69) 



where k G (0, 1) and iVo is nonnegative - the latter being the condition which 
enforces the inequality (lollj) . The resulting mapping can be described in terms of a 
coupling ( 1X4"|) , mediated by a beam-splitter of transmissivity k [33 |I194] . between the 
input state of the system and an external Bosonic environmental mode initialized 
in a thermal (Gibbs) state fj48|) with the mean photon number N = N /\l — k 2 \. 

ii) Amplification channels. These are described by the same matrices K and of 
Eq. (|69|) were now however the parameter k assumes values larger than 1. As in the 
previous case the resulting mapping can be described in terms of a linear coupling 
with an environmental state initialized in a thermal state with photon numbers N Q . 
In this case however the coupling is provided by a two-mode squeezing Hamiltonian 
which induces a linear amplification of the impinging field |194j . 

Additive classical noise channels. These maps are characterized by the parameters 



m 



K 



1 





P = N 



(70) 



with Nq ^ 0. In this case it is possible to write a simple integral representation for 
the channel 

1 



nN 

with -D(C) = exp[£at — £a], which is a continuous analog of the Kraus decomposition 
expressing the fact that the channel performs random Gaussian displacements of the 
input state as a result of action of a classical random source, e.g. see Ref. [71 J. These 
maps has been also analyzed in the context of universal cloning machines |124[ 134] . 

Let us now see which of the above channels are entanglement-breaking basing 
on the decomposability criterion (1671) . To do so we rely upon the simple fact that 
(N + 1/2)1 > z'A/2 if and only if iV > 0. Therefore since we have K T AK = k 2 A, it 
follows that in this case the decomposability condition holds if and only if (3 > |(l+fc 2 )A 
which is equivalent to N + |1 — k 2 \/2 > (1 + k 2 )/2 or 

iVo > min (l, k 2 ) . (72) 

This gives the condition for entanglement breaking applicable to the channels of the 
classes (i), (ii) and (iii). 



D{QpD\Q exp(-|C| 2 /iV)d 2 C 



(71) 



4. Entropy, information, channel capacities 

The most profound results making the essence of information theory - coding theorems - 
have asymptotic nature and concern the transmission of messages formed by arbitrarily 
long, ordered sequences of symbols which are emitted by a "source" (say a radio 
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station). In practical applications the index that enumerates the various entries of 
the message represents either the (discrete) time when they appear sequentially in 
the communication process, or the number of "modes" in which a given signal can 
be processed simultaneously in parallel (e.g. the frequency modes in which a radio wave 
can be decomposed, or the cells that form a memory register). In a statistical description 
a question inevitably arises regarding what model for correlations between the different 
symbols of a message should be accepted. The simplest yet basic is the memoryless 
model where both the production of the symbols emitted by the source, and their 
subsequent transformations associated with the propagation through a communication 
line, are statistically identical and independent. The coding theorems of information 
theory are not at all trivial already for this case. At the same time they give a basis for 
considering more complicated and realistic scenarios taking into account memory effects. 
Moreover, the effects of entanglement are demonstrated in the memoryless case in the 
most spectacular way. Therefore in our presentation we concentrate on the memoryless 
configurations, providing, where appropriate, references for the more advanced memory 
models. 

In the following we start reviewing some basics results of the classical information 
theory that allow us to evaluate the information content of a classical message 
(Shannon's first coding theorem) and to introduce the notion of capacity for a 
communication channel (Shannon's second coding theorem). Moving into the quantum 
domain we will face then the fundamental issue on how to generalize these results 
in order to have a proper quantum mechanical measure of information. The task 
is particularly challenging due to the complex nature of quantum information (see 
discussion in Sec. 12.11) . In particular, there is a fundamental conceptual distinction 
between the amount of classical information that it is possible to store in a quantum 
system, and the amount of quantum information that it can accommodate. At the level 
of communication theory, this forces to introduce different alternative generalizations of 
the Shannon capacity, see e.g. [161 1166] . 

4-1. Information transmission over a classical channel 

Consider a classical source of information which emits ra-long sequences w = 
(xi, . . . , x n ) G X n formed by symbols Xj that are randomly extracted from an alphabet 
X according to a distribution P = {p x } (the process being repeated identically and 
independently for each element of the sequence as mentioned previously). In classical 
information theory the amount of information that is contained in one of those sequences 
is measured by the minimal number of binary digits (bits) necessary for its binary 
representation (coding). The first Shannon coding theorem [160J says that this number 
is ~ nH(X), where H(X) is the entropy (pQ) of the distribution P that characterizes 
the source. This is obtained by showing that for n ^> 1 the typical messages w 
which are most likely to be produced by the source all have approximately equal 
probabilities 2~ nH ( x \ so that their number is ~ 2 nH( - x \ Since the number of bits 
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necessary to enumerate such typical messages is ~ nH(X), it follows that, by tolerating 
an asymptotically small error probability, one can use this same number of bits to encode 
all possible messages emitted by source (e.g. associating to all the non typical ones a 
fixed erasure symbol). This result can be extended also to a wide class of stationary 
correlated sources models [37J and provides an operational interpretation of H(X) as 
the information content per symbol of the average message emitted by the source. 

Assume now that the messages produced by the source are transmitted through 
a classical noisy channel which maps the n-long input sequences w = (xi, . . . ,x n ) into 
ra-long output messages w' = (yi, . . . , y n ) G y n whose symbols yj G y appear with 
output probability distribution P' = {p' y } where p' y = J2 x P(y\ x )Px- ^ n this expression 
p{y\x) is the single letter conditional probability distribution which fully characterizes 
the process in the memoryless scenario. Memory channels in which the noise tampering 
with the transmission process acts on the letters of the message in a correlated fashion, 
require instead to assign the complete joint input-output probabilities for the messages, 
see e.g. [53]. 

Due to the stochastic nature of the transformation w — > w', the distinguishability 
of the various input messages is not necessarily preserved. Still one can show that 
by properly selecting the codebook of messages w (encoding), it is possible to convey 
(reliably) a definite amount of information to the output of the transmission line. 
Loosely speaking, the second Shannon coding theorem establishes that the amount of 
information per transmitted symbol that can be recovered from the channel outcomes 
is given by the Shannon capacity 

C Shan = m^xI(X;Y) , (73) 

where the maximum is taken over all possible distributions P of the input X. In this 
expression I(X; Y) is the mutual (or Shannon) information, 

I(X; Y) = H{Y) - H(Y\X) = H{X) + H(Y) - H(X, Y) , (74) 

with H{Y) being the entropy of the output distribution P' = {p' y } which measures 
the total information content of the output of the channel, both useful - due to the 
uncertainty of the signal X, and harmful - due to the noise in the channel. Similarly, 
H(X, Y) — — p x y log 2 p X) y is the joint entropy of the pair of random variables X, Y 
whose joint distribution is p XiV = p x p{y\x). Finally H{Y\X) is the conditional entropy 
(called also information loss) defined as 

h(y\x) = Y,p- H ^ Y \ x = x ) = -Y,p-Y,^y\ x ^°^y\ x ) ( 75 ) 

x x y 

= H(X,Y)-H(X) , (76) 

which reflects the effect of the noise in the communication. 

More precisely, the second Shannon theorem says that by using special (block) 
encoding of the messages at the input and decoding at the output of the channel it 
is possible to transmit ~ 2 nCshan (n — > oo) messages with asymptotically vanishing 
error, and it is impossible to safely transmit more whatever encoding and decoding 
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are used. Thus Cshan is the ultimate asymptotically errorless transmission rate for the 
channel p(y\x). In practice the optimal encoding and decoding doing the miraculous 
job of making the noisy channel partially invertible, i.e. transparent to some selected 
messages, are extremely difficult to approach, but Cshan gives the benchmark describing 
the exponential size of such an ideal codebook. Another insightful interpretation of the 
quantity Cshan is given by the "reverse Shannon theorem" (formulated and proved 
surprisingly rather recently [18] , under the influence of ideas from quantum information 
theory). This theorem implies that the ratio log 2 \X\/Cshan is equal to the number 
of copies of the noisy channel p(y\x), needed to simulate (asymptotically) the ideal 
(noiseless) channel with the maximal capacity log 2 \X\. 

The memoryless nature of the classical channel we are considering here is reflected 
by the additivity property 

C Shan = 71 C Shan , (77) 

where Cg^ is the Shannon capacity of the n-block channel one obtains by treating n 
instances of the source as a new source that emits n-long strings w = (xi, • ■ ■ , x n ) as 
individual super-symbols. Accordingly in the definition of C^ an the maximum in (1751) 
is taken over the all input distributions on the space X n , including the correlated 
ones. 

4-2. Quantifying information in a quantum world 

Let us start with a quantum source of information that produces n-long sequences 
of quantum symbols (pure states) \ipj) 6 H, creating factorized states of the form 
\^) = iV'ji) ® • • • <8> iV'jn) e H-® n (a practical example is a pulsed laser that emits 
series of random optical signals characterized by different intensity values). As in the 
classical case we assume that each symbol composing the sequence \^f) is produced in a 
memoryless way, i.e. it is extracted from a given set of possible states independently with 
certain probability distribution {pj}. We ask what is the minimum number of qubits per 
symbol necessary to express the generic state produced by such a process. Schumacher 
and Josza |106j answered this question by showing that in the limit (n — > oo) this number 
coincides with the von Neumann entropy S(p) of the density operator p = Pj\i/jj) (ipj\ 
which represents the average state emitted by the source. More precisely, they proved 
that with the high (asymptotically unit) probability, the sequences |\&) span a subspace 
of %® n of dimensionality m 2 nS ( p \ Since the density matrix describing the average 
sequence is p® n , this can be rephrased by saying that nS(p) is the logarithmic size of 
a "quantum register" in which a given quantum message p® n can be "packed" optimally 
with negligible loss of information. Thus in quantum information theory the logarithm 
of the dimensionality of the space of state vectors carrying information is the measure of 
information content of the system, and plays a role similar to the logarithm of size of the 
codebook for classical messages. Analogously to what the first Shannon coding theorem 
says for the Shannon entropy, the above result provides an operational characterization 
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for the von Neumann entropy S(p), presenting it as a fundamental measure of the 
amount of quantum information that can be stored in the density matrix p. 

This result admits also another operational interpretation. Viewing the quantum 
source defined above as a special instance of a c-q channel (|4"U|) that encodes the classical 
variable j into the pure quantum states the result of Ref. |106] indirectly quantifies 
the amount of quantum resources (qubits) that are necessary to carry on such an 
encoding. The generalization of this result to arbitrary c-q channels is the subject 
of the next Section. 

4-3. The classical capacity of quantum channel: part I 

In Sec. I2.8I we have seen that the simplest quantum model of a communication line is 
the c-q channel (HOI where some classical data x G X are mapped into a fixed family of 
the output quantum states {p x } in the receiver space 7-L. If the letters of the message 
w = (x\, . . . ,x n ) are transmitted independently of each other (no memory) then at the 
output of the composite channel one has separable state p Xl ®.. -®p Xn in the space 1-L® n . 
Decoding at the output requires a quantum measurement in 1-l® n , the outcome of which 
gives an estimate w' for w. For each specific choice of such measurement we can thus 
define a classical channel which takes the classical random variable w to its output 
counterpart w' (passing through the quantum stage w — > p Xl <8> . . . <8> p Xn ). 

Its ability in transferring classical information can now be evaluated by the 
associated Shannon capacity defined as in Sec. El Define to be the maximum 
of this quantity obtained by optimizing it with respect to all possible measurements on 
l-i® n . It turns out that in general, due to existence of entangled measurements at the 
output, differently from ( 1771) . one may have C^ n > > n In other words, for the c-q 

memoryless channels the transmitted classical information can be strictly superadditive. 
A proper definition of the capacity requires hence a regularization with respect to the 
block coding size, i.e. 



Remarkably, the quantity C x , defined rather implicitly by this equation, admits an 
explicit entropic expression. For a statistical ensemble consisting of the density matrices 
{p x } with the probabilities {p x } define the x-injormation as 



This quantity is nonnegative due to the concavity of the von Neumann entropy (see 
also Sec. 14.12ft . To certain extent the ^-information can be regarded as a quantum 
analog of the Shannon information defined in Eq. ( I74p (see however the discussion 
in Sec. 14. 5p . Both these quantities are the differences between the overall output 
entropy of the channel and a term which can be interpreted as the conditional entropy 
(loss). Furthermore, x provides a fundamental upper bound, first proved in [83J, for the 
Shannon information I(X, Y) between the random variable X having the distribution 



C x = lim C {n) /n . 



(78) 




(79) 
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{p x } and the random variable Y describing the outcome of a measurement at the output 
of the channel aimed to recovered the value x, namely 

I(X,Y)< X ({p x },{p x }) ■ (80) 

The coding theorem established by Holevo [851 El] and by Schumacher- 
Westmoreland [153] shows that there exist block coding strategies which, in the limit 
of large n, saturate the bound imposed by Eq. ( IBOl . Thus the abstractly defined 
capacity flTHl) acquires the following compact "one-letter" expression 

C x = max x({Px},{px}) , (81) 

(the maximization is performed over probabilities {p x } while keeping fixed {px})- This 
relation can be regarded as the "classical-quantum" analog of the second Shannon 
coding theorem for the noisy channel. For the later modifications of its proof 
see [TM [TM [T371 1771 IT26] . 

An apparent but important consequence of the bound flHUl) is the inequality 

C x <log 2 dimH, (82) 

in which the equality is attained if the quantum source operates with orthogonal states 
p x . Thus, the fact that the space "H contains infinitely many state vectors, does not allow 
to increase the classical capacity above the ultimate information resource of the quantum 
system; increasing the number of signal vectors forces them to become nonorthogonal 
and hence less and less distinguishable. This is in line with the observation in the pre- 
vious section where it was shown that the logarithm of dimensionality of the Hilbert 
space of a quantum system determines the ultimate bound on the amount of quantum 
information one can store in it. 



Example 1: Consider the binary input signal x = ±1, and let p±\ be the coherent 
state of a monochromatic laser beam with the complex amplitude ±z. This defines a 
c-q channel with two pure nonorthogonal states |90j whose classical capacity (IHTj) can 
be computed as 

C x = h (i±i) , (83) 

where e = (z\ — z) = exp(— 2|z| 2 ) is the overlap between the two coherent states, while 
^(p) = — plog 2 p— (1 — p) log 2 (l — p) is the Shannon binary entropy. Also the quantity 
can be explicitly computed [119] yielding 

C<» = 1 - h 2 + . (84) 

For this special case one can then easily verify that C x /C^' > 1, the ratio tending to 
oo in the limit of the weak signal, as e — > 1, i.e. z — > 0. 
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Example 2: Consider the c-q channel in which the classical alphabet X is 
composed by three symbols (say x — 0, +, — ), which are mapped into three equiangular 
pure states l^) of a qubit system, 



|V>o> 



1 





-1/2 
iV^/2 



J5) 



In this case of equal probabilities, p x = 1/3, the average density matrix coincides with 
the chaotic state (i.e. p = J2 X I V^} {fix |/3 = -f/2). The classical capacity f lHTj) of the 
channel is hence C x = 5(7/2) = 1 which saturates the bound ( 182]) in spite of the 
fact that the states are not orthogonal. Also for this channel the gap between C x 
and can be shown. One has ~ 0.645, the value attained for the distribution 
p + = p_ = |, = and for the measurement corresponding to the orthonormal basis, 
optimal for discrimination between the two equiprobable states \ij) x ),x = ± |150[ 1164] . 
However the optimal encoding and decoding for this case are unknown, as for the most 
of the Shannon theory. 

More generally, let {|V> x )j x X} be an overcomplete system (I44p in a <i-dimensional 
space (the above example being just a special case with \if> x ) = y/2j3\il> x )). Then the 
overcompleteness relation can be written as ^2 x p x Px — I/d, where p x = (ipx\ipx)/d and 
Px = Wx)wx\/y^»Wx)- This implies that the c-q channel x — > p x has the capacity 
C x = S(I/d) = log 2 d. Furthermore, since the inequality in (IBUl) is strict unless the 
operators p x commute [83J, one can conclude that < C x = log 2 d unless {|^)} is 
an orthonormal basis. 

Example 3: Consider the c-q channel model introduced in Sec. 13.11 where a 
continuous alphabet C is encoded into quantum states via the mapping p -» p^ with 
p M given by f T49|) . If one tries to compute the classical capacity of such a channel by 
a continuous analog of the formula ( 18T1) where the probabilities p x are replaced by 
probability distributions p(p) on the complex plane C, i.e. 

G * = { S (/ p(y ^ p ^ d2p ) ~ / p(y ^ S<yp ^ d2p \ ' ( 86 ) 

one gets infinite value as it is to be expected for a channel with infinite input alphabet. 
To obtain a reasonable finite result one should introduce a constraint onto possible input 
distributions p(p) restricting the "energy" that one can actually add in the fiber, namely 



\p\ p(p)d p < E . (87) 

With this constraint, the maximum (|86l) can be evaluated by taking into account two 
facts: first, the states (1491) are all unitarily equivalent and have the same entropy 

S(pJ = S(po) = g(N) = {N + 1) log 2 (iV + 1) - iVlog 2 N , (88) 

see e.g. (87]; second, the constraint flHTj) implies 

Tr p a) a < N + E , (89) 
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where p = J p(p)p fJL d 2 p is the average quantum state transferred in the communication. 
Consequently, by the maximal entropy principle, the entropy S (p) is maximized for the 
Gaussian distribution p(p) = exp ( — \p\ 2 /E) giving the value g(N + E), whence the 
channel capacity is 

C x = g(N + E)-g(N). (90) 

In the classical limit of large energies (i.e. iV — > oo, E/N — > const) this turns 
into C x = log 2 (l + E/N), which can be regarded as a generalization of the famous 
Shannon's formula |160] Cshan = \ log 2 (1 + E/N), for the capacity of memoryless 
channel with additive Gaussian white noise of power N (the factor 1/2 being absent 
in the quantum case because one degree of freedom amounts to the two independent 
identically distributed real amplitudes). The capacity formula can be generalized to 
many-mode (broadband) c-q Gaussian channels (33], |86] also in the presence of squeezed 
seed states [95J. 

In the Shannon theory one considers memory channels with stationary Gaussian 
noise by making spectral decomposition of the time series in question. It turns out 
that the modes corresponding to different frequencies can be considered asymptotically 
independent if the observation time is very large. Then effectively one has a set of parallel 
independent channels which can be approached as a kind of "memoryless" composite 
channel |37j. Similar reduction to the memoryless channel in the frequency domain can 
be elaborated for the c-q channels with additive stationary quantum Gaussian noise [86J . 

4-4- The classical capacity of quantum channel: part II 

In the previous section we focused on the basic case of quantum channels that can be 
used to transfer classical messages, the c-q channels. They are formally constructed 
by fixing a set of possible quantum letters p x in the space % organized in the n-long 
sequences p Xl <E> • • • <8> p Xn that form the codewords in which the classical information can 
be encoded. 

More generally, assume that the sender of the classical information is allowed to 
use as input any (possibly entangled [15J, [16]) joint density matrix p^ G S("H® n ) of 
n quantum information carriers. Here w denotes a classical message encoded into the 
state p {w) . We assume that there are N different messages to be transmitted, hence w 
can be simply the number of the message, w = 1, . . . , N. Assume also that during the 
transmission stage each component of the codeword individually affected by the 

same noisy channel (memoryless regime), producing the outputs of the form 

p W — y $®«[ p H] ; (91) 

where $® n is the n-fold tensor product defined similarly to (1341) of the quantum channel 
$ which gives the statistical description ffT4l) of the interaction of a single carrier with 
the environment (i.e. noise). Notice that for each given collection of input density 
matrices {p^; w = 1, . . . , N}, Eq. ( 19T1) defines an effective c-q channel whose quantum 
letters are the density matrices $® n [pW] . The associated classical capacity of such c-q 
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channel is then computed along the lines of Sec. 14. 3[ 

C x (^ n ) = max v ({p w } , {*•><«>]}) , (92) 
{pw},{p {w) } 

where now, given the freedom the sender has in selecting the input density matrices 
of the channel, the maximum is performed over the set of statistical ensembles formed 
by the probabilities {p w } and the states {p^ w '}- Loosely speaking, Eq. ( 1921) gives the 
ultimate rate of classical bits that one can transmit when using quantum block-letters of 
size n, hence corresponding to the rate C x (<&® n )/n bits per individual use of the channel 
$. Therefore the ultimate (asymptotically achievable) rate is given by the expression 

C($) = lim -CJ^ n ) . (93) 

n—>oo n 



It should be stressed that the regularizations involved in Eqs. (1931) and ( 1781) have different 
origin. While taking the limit in Eq. (I78|) is required by superadditivity due to possible 
use of entanglement at the decoding stage of the communication process (see previous 
Section), the limit in Eq. (I9"3"j) is required by possible superadditivity of the function 
C x ($ (gln ) as a consequence of using entangled quantum block-letter p-™' at the encoding 
stage. 

In the case where the property of additivity holds, i.e. 

C x (**») = n C x (*) , (94) 

the regularization ( 1931) is not needed and the capacity admits a simple single letter 
expression C($) = C x ($), where 



Px,Px 



\ X / X ) 



C x ($) = max IS > p x $ [ Px ] \-yPxS ($ [p x ]) } , (95) 



and the maximization is over all finite ensembles of states p x taken with probabilities 
p x . The additivity ( I94p means that using entangled states at the input of the channel 
$ does not increase the quantity of transmitted classical information. The validity of 
the property ( 194"]) was established for a number of channels, including the unital qubit 
channels |109] , the depolarizing channel [HOj , the erasure channel [13] , the purely lossy 
Bosonic channel [58], and the whole class of entanglement-breaking channels [163] . In 
all of these cases except the last one the analytical solution of the maximization problem 
for C x ($) is possible. 

Example: For the depolarizing channel of Eq. (I2ip . the additivity (194")) combined 
with the high symmetry of the map $ allows one to find its classical capacity |110j . 

C(*) = C x (<&) = log 2 d+(l- p^-) log 2 (l - p^i) + p d -^- log 2 V - , (96) 
with the maximum attained on the ensemble of d equiprobable orthogonal pure states. 



The need of the regularization limit in Eq. ( 1931) for a generic channel has 
been debated at length, for a survey see e.g. [92], I147j . The question, if there 
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exist at all nonadditive quantum channels, turned out to be extremely difficult and 
remained open for rather a long time. The additivity of C x plays an important role 
in quantum information and is linked to the additivity of other quantum entropic 
quantities |130[ 1167] including the minimal output entropy mm p S($[p\). A significant 
step forward was made by Shor |167j . who showed in particular that proving the 
additivity of the minimal output entropy for all pairs of channels, i.e. 

minS($i ® $ 2 [p]) = minS($i[p]) + minS($ 2 [p]) , V$i,$ 2 , (97) 
p p p 

is equivalent to proving the similar additivity for C x , i.e. 

C x ($! ® $ 2 ) = C7 x ($i) + C x ($ 2 ) , V$i, $ 2 , (98) 

and hence it would imply ( 194|) . This in turn stimulated intensive research of several re- 
lated quantities, which besides the von Neumann entropy |113j . include quantum Renyi 
entropies see e.g. Refs. 11081 111 1 1 11591 loll |5j |46]. The possibility of violating the ad- 
ditivity for these functionals was established in several cases [1871 11891 1781 1791 138] , but 
the violations were not strong enough to imply superadditivity of C x . The problem has 
been settled by Hastings [71] who proved that channels which violate the additivity ( 1971) 
and hence require the regularization in Eq. (193]) do exist at least in very high dimensions 
(see [52J for the actual dimensionality estimates), among mixtures of unitary channels of 
the form <£>[p] = J2j n jUjpUj (here TTj is a probability distribution while Uj are unitary 
operators), but so far no concrete example was found. 

It is finally worth mentioning that transmission of classical information through 
quantum channels may display yet another form of superadditivity of a higher 
complexity level as compared to the one implied by the regularizations of Eqs. flTSj) 
and ( |93l) . Namely, consider two different quantum channels $i and $ 2 with the classical 
capacities C($i) and C($ 2 ) defined as in ( 1931) . The problem is whether the capacity 
C($i <8> $2) °f the tensor product channel $1 <g) $ 2 can be strictly greater than the sum 
of the two individual capacities, i.e. 

C($i ® $ 2 ) > C($i) + C($ 2 ) . (99) 

It should be stressed that due to the regularization present in ( 193]) the superadditivity 
of the functional C x (-) proved in Ref. [71] is not sufficient to answer this question. In 
fact, the strict inequality in ( 1991) would be similar to the superactivation property for the 
quantum capacities |171j . see Sec. 14.111 Proving C($i £g> $ 2 ) > C($i) + C($ 2 ) is trivial: 
this follows simply from the possibility of operating the two channels independently, 
i.e. using codewords that factorize in the partition $! <g> $ 2 - However the possibility 
to have the strict inequality here stems from the freedom of introducing the quantum 
correlations at the input of such a partition. 

4-5. Entropy exchange and quantum mutual information 

In the classical case correlation between two random variables X, Y (which, in particular, 
may describe input and output of a classical channel) can be measured by the mutual 
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purified 




Figure 3. Schematics of the purification of the information transfer through the 
quantum channel $ connecting the input system A to the output space B. Here R is a 
reference system which is used to purify the input states pa of A. Following the analysis 
of Sec. [2] the mapping $ is then represented in terms of an isometric transformation V 
that maps vectors of A into (possibly) entangled states of B and of the environment 
E. The reduced density matrix of B and E provide, respectively, the output states of 
the channel $ and of its complementary In the quantum wiretap model of Sec. 14.91 
the reduced density matrix of E is assigned to the eavesdropper. 

information /(X; Y) introduced in Eq. ( 1741) . A quantum analog of this quantity has been 
identified in Sec. 14.31 as the ^-information of Eq. (179|) . The quantity x however does 
not account for all the correlations which can be established between the input and the 
output of a quantum channel. To characterize them one needs to look for other quantum 
generalizations of I(X; Y) (the possibility of multiple quantum generalizations should 
not be surprising: ultimately this is a consequence of the fact that joint distribution of 
quantum observables exists only in the very special case when they commute). 

One way to proceed is to exploit the following purification trick [1531 1154] . Consider 
a quantum channel $ = <$>a-^b that maps the input states p = pa G &(T-La) into the 
outputs pb = Let us introduce a reference system Hr ~ %a and purify the 

input p to \4>ar)(4>ar\ in the space Ha ® Hr. By Eq. (JHJ) the state pn of the reference 
system has the same spectrum as p and hence S(p) = S(pr). In what follows, for the 
sake of simplicity, we shall abbreviate the notations for the entropies of partial states 
by omitting the symbol of the density operator p, so, for example, the last equality 
will be written as S(A) = S(R). Now let us focus on the entanglement transmission 
scenario (|29|) where the state \ip ar) ar\ is transmitted via the channel §a^b ® Id^, 
producing the output state 

Pbr = (®a^b <8> Id*) War) {^ar\} ■ (100) 
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Consider the quantity 

I{p,$) = S(R) + S(B)-S(BR) , (101) 

which is nonnegative and vanishes if and only if pbr = Pb ® Pr, see Sec. 14.121 Since 
the reference system R is a copy of the input A which remains intact in the course 
of the transmission, one can interpret I(p, $) as another substitute for the Shannon 
mutual information between the input and the output of the quantum channel. It is 
called the quantum mutual information [1231 |2]. It is combined of the three entropies: 
S(R) = S(p) - the input entropy, S(B) = S(Q>[p]) - the output entropy, and the joint 
entropy S(BR) which deserves a closer look in the isometric representation ( 13?]) of the 
channel 

A useful expression for the quantity f llOip is obtained by introducing the channel 
environment E and the isometry V : Ha — ► %b®He which represent the transformation 
$ as detailed in Sec. 12.31 see Fig. |3j At the end of the transmission we have the tripartite 
system BER characterized by the space Hb ® We ® Hr- The total output state Pber 
in %b <S> He <8> Hr is pure and is described by the vector \iPber) = (V (8) Ir)\4>ar)- 
Looking at the split BR\E we have a pure state bipartite system with partial states 
Pbr = T^ePber, Pe = T^brPber whose entropies S(BR) and S(E) are equal, see 
Eq. (jSJ). This implies 

S(BR) = S(E) = S(p, $) , (102) 

where the last quantity is called the entropy exchange |123l 11531 1154] . as it measures 
the entropy change in the environment (recall that in the isometric representation the 
initial environment state is assumed pure). Notice also that by construction pE is the 
state at the output of the complementary channel Therefore by Eq. (EE}, 

S(p,Q) = S(9\p]) = s(\TrpV}V a ] ) . (103) 

\L J a,P=l,...,d E / 

The quantum mutual information can then be expressed as 

I(p,$) = S(p) + S($[p])-S(p,$). (104) 



Example: Consider the depolarizing channel ( 1211) and the chaotic state I/d. By 
using the Kraus decomposition ( 122|) we find that the state $[I/d\ has eigenvalues and 
1 ~P^7j^~ °f rnultiplicity 1, and Jj of multiplicity d 2 — 1 (the zero appearing because of 
the linear dependence of the Kraus operators). Hence, 

/ J 2 - 1 \ / d 2 - 1 \ d 2 - 1 n 
S(I/d, $) = - (l - p-^-) log 2 (l - P —) - p— log 2 | . (105) 

Combining with the input and the output entropies S(I/d) = S(Q[I/d}) = log 2 d, we 
obtain 

I (I/d, $) = log 2 d 2 + (l - P^~^) log 2 (l - P^p^) + P^^- log 2 j 2 ■ (106) 
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Up to now we have encountered three entropic quantities: the input entropy S(p), 
the output entropy £ ($[/>]) and the entropy exchange S(p,$). By making different 
bipartite splits of the system BER, we obtain similarly to (I102p the identities 

S{BE) = S{R) = S{p) , (107) 
S(RE) = S(B) ee S($\p]) , (108) 

and another two information quantities: the loss, i.e. the quantum mutual information 
between the input and the environment: 

L(p, $) ee S(R) + S(E) - S(RE) = S(p) + S(p, $) - S($[p}) , (109) 

and the noise, i.e. the quantum mutual information between the output and the 
environment 

N(p, $) ee S(E) + S(B) - S(EB) = S(p, $) + - . (110) 

The nonnegativity of the quantities I(p, $), L(p, $) and iV(p, $) along with fll07p . 
(11081) implies that the basic entropies S(p), S(<&[p)), and S(p, $) satisfy all the triangle 
inequalities, namely, 

|S($[p])-S(p,$)|<S(p), (111) 

and two other inequalities obtained by cyclic permutations. 

As in the case of the Shannon mutual information (!74j) . the quantities I(p, 
L(p, $) and N(p, $) can also be expressed in terms of conditional (quantum) entropies. 
For instance we have 

J(p,$) = 3{B) -S(B\R) , S(B\R) = S(BR)-S(R) = S(p,$)-S(p), (112) 

with S(B\R) being the quantum conditional entropy of B given R. Notice however that 
while in the classical case the conditional entropy is always non-negative, in the quantum 
scenario this is no longer valid as the quantum entropy is not necessarily monotone with 
respect to enlargement of the system, i.e. S(R) S(BR) (an extreme example of this 
property is obtained when BR is a pure entangled system: in this case S(BR) = while 
S(R) > - this cannot happen in the classical statistics where the partial state of a 
pure state can never be mixed, see also Sec. 12.11) . An operational interpretation of the 
possible negativity of the conditional entropy in terms of "quantum state merging" is 
given in Ref. [100] . Quite remarkably, the monotonicity of the conditional entropy still 
holds: for any composite system ABC one has 

S{A\BC) < S{A\B) , (113) 

and it is this property which makes the quantum conditional entropy useful. When 
written in terms of (unconditional) quantum entropy, the inequality (11131) amounts to 
the fundamental property of strong subadditivity: 

S{ABC) + S(B) < S(AB) + S(BC) . (114) 
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The proof of this very useful inequality was given first by Lieb and Ruskai in Ref. [121] 
and remains rather involved in the quantum case even after subsequent simplifications, 
see e.g. |148j . 

Due to the strong subadditivity (11141) . the quantum mutual information enjoys 
important properties similar to that of the Shannon information in the classical 
case [21 1135j . Specifically, given two quantum channels $i and <£>2, one has 

(i) subadditivity: I(p 12 , $i <g> $ 2 ) < I{pi, $i) + Hp2, $2) ; (115) 

(ii) data processing inequalities: I(p, $ 2 $1) < min{/(p, $x), J($ 1 (p), $2)} • (116) 

4.6. Entanglement as an information resource 

In the previous sections we have seen that entangled decodings can enhance transmission 
of classical information through quantum channel. An even more impressive gain 
is achieved when entanglement between the input and the output of the channel is 
available. The classical capacity of a memoryless communication line defined by a 
quantum channel <3> can be substantially increased by using this additional resource in 
spite of the fact that entanglement alone cannot be used to transmit information. This 
fundamental observation was first made by Bennett and Wiesner who introduced the 
notion of superdense coding [19] . see also [TJ [261 H46] for generalizations to continuous 
variables and |131[ I141[ 11201 H33J for experimental tests. Here as in some other cases 
entanglement plays a role of "catalyser", disclosing latent information resources of a 
quantum system. 

The scenario of entanglement- assisted communication assumes that prior to the 
information transmission parts of an entangled state pab are distributed between sender 
A and receiver B. Then sender A encodes the classical messages w into different 
operations £ w on his part of the entangled state, and the result of these operations 
is sent to B via the quantum channel $. Thus at the end of the transmission the state 
($ o £ w <S> Ids) [pab] is available to the receiver which extracts the classical information 
by making quantum measurement on this state. Optimizing the transmission rate 
with respect to the entangled states pab, (block) encodings of A and measurements 
(decodings) of B gives the classical entanglement- assisted capacity. The corresponding 
coding theorem of Bennett, Shor, Smolin and Thapliyal provides a simple formula 
(the proof of which is far from being simple, [HI [171 EE]) giving an operational 
characterization for the quantum mutual information (11041) : 

C ea ($)= max J(p,$), (117) 
p 

where the maximum is taken over all possible input states p of a single channel use. 
(Equation (1 11 7ft refers to the case of unlimited shared entanglement. For analysis of the 
case in which only limited resources are available see Refs. [1671 [2T[ B5]). Remarkably, 
unlike Eq. (1931) . the capacity formula (11171) does not contain the limit n — > 00 because 
C e a(&® n ) = n C ea ($) due to subadditivity (11151) of the quantum mutual information. 
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Also by construction it follows that entanglement-assisted capacity C ea ($) is always 
greater than or equal to its unassisted counterpart C($). 

If $ = Id is the noiseless channel then this scenario coincides with the original 
superdense coding protocol [19] for which C ea (Id) = 21og 2 <i = 2C(Id), so the capacity 
gain is equal to 2. Similar doubling of the capacity holds for the quantum erasure 
channel. However in general, the more noisy is the channel - the greater is the gain, 
and in the limit of very noisy channels the gain can be arbitrarily large. For example, in 
the case of depolarizing channel ( )2T|) . the maximum in ( 1117P is attained on the chaotic 
state so that C ea ($) is given by the formula (j!06p . Comparing this with the quantity 
C(<3>) given by the formula ( 1961) . one sees that the gain — > d + 1 in the limit of 
large noise p — > 1. 

Interestingly, one can have C ea ($) > C(<&) for entanglement-breaking channel, for 
example, this holds for the depolarizing channel with p > d/(d + 1). The explanation 
given in [T7] is that unassisted classical data transmission through entanglement- 
breaking channel involves communication cost which is always greater than the difference 
between C ea (<3>) and C($). 

A protocol in a sense dual to superdense coding is quantum teleportation which 
was first introduced in Ref. [UJ. Quantum information theory predicts the possibility 
of a nontrivial way of transmitting arbitrary quantum state p, when the state carrier 
is not transferred physically but only some classical information is transmitted trough 
a classical communication line, see also Refs. [25] I132[ 1179] for the generalization to 
continuous variables and Refs. [TOj [20] for the first experimental test. The necessary 
additional resource here is a shared entangled state between the sender and the receiver 
of the classical information (it is impossible to reduce the transmission of an arbitrary 
quantum state solely to sending the classical information: since the classical information 
can be copied, it would mean the possibility of cloning the quantum information |193j ). 
The effective maps $ resulting from such protocols are called quantum teleportation 
channels [99j I143[ 123] 1192] . They represent a proper subset of the class of quantum 
communication systems which can be fully specified by assigning a joint initial state 
Pabi characterizing the shared entanglement between the sender and the receiver, and 
the local operations the two parties are supposed to perform on it. 

4-7. Coherent information and perfect error correction 

An important part of the quantum mutual information J(p, $) is the coherent 
information |153] 

I e {p,9) = S{*\p]) - S(p,*) 
= S(B) - S(E) 
= S(B) - S(BR) 

= -S(R\B) . (118) 
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This quantity is closely related to the quantum capacity of the channel $ which will be 
considered in the next Section. 

The coherent information does not share some "natural" properties of quantum 
mutual information such as subadditivity and the second data processing inequality. 
Moreover similarly to its classical analog H(Y) — H(XY) = — H(X\Y) it can be 
negative. However I c (p, $) satisfies the first data processing inequality: 

4(p,$ 2 o$i)</ c (p,$i) , (119) 

which follows from the relation I c {p, 3>) = I{p, 3>) — S(p) and from the corresponding 
property of quantum mutual information I(p, $). 

There is a close connection between perfect transmission of quantum information, 
error correction and certain property of coherent information. The channel $ is perfectly 
reversible on the state p = pa, if there exists a recovery channel V from B to A, such 
that 

{Vo^®l<\ R )[ PAR } =p AR , (120) 

where par is a purification of the state p A with the reference system R. 
One can show [9] that the following statements are equivalent: 

(i) the channel $ is perfectly reversible on the state p; 

(ii) L(p, $) = 0, that is p RE = Pr® Pe] 

(iii) I e (p,*) = S(p). 

The condition (ii) means that information does not leak into the environment, i.e. 
the channel is "secret" or "private". Thus the perfect reversibility of the channel is 
equivalent to its privacy. The condition (iii) means that under private transmission 
of the state p by the channel <3>, the coherent information I c (p, $) should attain its 
maximal value S(p). The equivalence of (ii) and (iii) is obvious since S(p) — I c (p, $) = 
S(p) + S(p; $) - S($\p]) = L(p, $) > by Eq. ([109]). 

In particular, choosing the chaotic state p = I A /d A , we get the condition log 2 d A = 
I c (I A /d A ,Q). This shows that the coherent information should be related to the 
quantum capacity of the channel $, which characterizes the maximal dimension of 
the perfectly transmittable states. (In fact, such a relation is valid for asymptotically 
perfect transmission through the block channel $ (g,n when n — > oo, as we shall see in 
the following Section). In support to this argument we notice here that the perfect 
reversibility property of $ on p can also be stated by saying that there exists a recovery 
channel T> such that 

Vo$[p'] = p' , (121) 

for all states p' with suppfp'] C C = suppfp], with supp[p] being the support of the 
state p. We can express the same property by saying that $ is perfectly reversible on 
the subspace C. In other words, the subspace £ is a quantum code correcting the error 
described by the noisy channel $ (and hence, all the related errors p — > VjpVj , where 
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On the other hand, notice that by the formula (I102p S(p; $) = where $ is 

the complementary channel, hence 

Up, $) = s($\p}) - S($[p]) = -i c ( P , <£). (122) 

The next statement which generalizes observation at the end of Sec. 12.71 gives a 
characterization in terms of complementary channel which underlies the coding theorem 
for the secret classical capacity of the channel in Sec. 14.91 The following conditions are 
equivalent: 

(i) the channel <3> is perfectly reversible on the subspace £; 

(ii) the complementary channel ( 138]) is completely depolarizing on C, i.e. 

%'] = Pe , (123) 
for any state p' with supp[p'] C £, where pE is a fixed state. 

4-8. The quantum capacity 

The transformation p — > $[p] of quantum states can be regarded as the transfer of 
quantum information. The discovery of the quantum error- correcting codes [162] 1172] 
is related to the question of asymptotically (as n — > oo) error-free transmission of 
quantum information by the channel The quantum capacity Q(<&) is defined as 

the maximum amount of quantum information per one use of the channel which can be 
transmitted with asymptotically vanishing error [161 H25] [HI l4~4"l!118j . It is related to the 
dimensionality of the subspace of state vectors in the input space (~ 2 nQ ^) that are 
transmitted asymptotically error- free. For the quantum capacity there is an expression 
in terms of coherent information (11181) . namely 

Q($) = lim -max/ c (p( n \$® n ) , (124) 

the maximum being performed over all input states p^ n > of n successive channel uses. 
The relation between Q(<&) and the coherent information of the channel was conjectured 
in Ref. [125J and made more precise in Refs. [9~] I165] . while the ultimate proof of Eq. (I124p 
was given by Devetak [44J exploiting the fact that the quantum capacity of a channel 
is closely related to its cryptographic characteristics, such as the capacity for the secret 
transmission of classical information and the rate of the random key distribution. More 
specifically, as discussed in detail in the following Section, a deep analogy with a wiretap 
channel |190] was used, the role of the eavesdropper in the quantum case played by the 
environment of the system. 

For the ideal channel Id one easily verifies that Q(ld) = log 2 <i. Analytical 
expression for the capacity of quantum depolarizing channel (12T|) is still unknown in 
the general case, although there are fairly close lower [HI SEJ [72] and upper [1801 11441 
I145[ 1170] bounds for it. A major difficulty of evaluating Q(&) lies in the nonadditivity 
of the quantity I c (p( n \ $® n ) [38]. However, there is an important class of degradable 
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channels |?7J EH] for which the additivity holds, so one can replace Eq. (I124p with the 
convenient "one-letter" formula 



g($) = g 1 ($)=max/ c (p,$) , 



(125) 



p 



where now the maximization is taken over the input states p for single channel use. The 
channel $ is called degradable if there exists quantum channel T such that $ = To$ [47] . 
In practical terms, this relation expresses the fact that the complementary channel $ is 
"more noisy" than $ (it can be obtained from the latter by "adding" the extra noise T). 
Notable examples of degradable channels are the dephasing channels [171 1112] and the 
amplitude damping channel [56] for which the maximization in dl25[) can be explic- 
itly performed. Another class of interest is constituted by the so called anti- degradable 
channels: $ is called anti-degradable [32| 156] . if there exist a channel T', such that 
$ = T' o $. Apparently $ is degradable if and only if $ is anti-degradable. Any anti- 
degradable channel <£> has the null quantum capacity, Q($) = 0. A formal proof of this 
fact will be given in Sec. I4.12( there is however a simple heuristic argument based on no- 
cloning theorem, see Eq. ( Fl3|) . that explains why Q($) cannot be positive. Suppose that 
<5($) > for anti-degradable channel $. This implies that, via encoding and decodings 
protocols, the sender will be able to transfer to the receiver at the output of the channel 
$ an arbitrary (unknown) pure quantum state Since $ is anti-degradable, this 
implies that the same protocol will also allow to recover the same quantum message at 
the output of the complementary channel $ (one can reconstruct the associated output 
of $ by applying the channel T'). Now the contradiction arises by observing that in 
the isometric representation introduced in Sec. 14.51 $ and $ describe the two reduced 
quantum information flows that enter, respectively, to the receiver and to the channel 
environment. Two independent observers collecting those data will thus be able to get a 
copy of the same state realizing de facto a cloning machine, which is impossible. No- 
table examples of anti-degradable channels are provided by the entanglement-breaking 
channel (T431) whose complementary counterparts are the dephasing channel ( ]4"5i) (the 
channel T' in this case can be taken as T'[p] = J2 a \(p a )(e a \p\e a )((p a \). 

Example: The quantum erasure channel $ p introduced in Eq. ( 1351) is degradable 
for p G [0,1/2] and anti-degradable for p G [1/2,1]. Its quantum capacity is 
computed p2] as 



Another class of channels which, as the anti-degradable ones, posses null quantum 
capacity is given by the so called PPT (partial positive transpose) or binding 
channels |102j . These maps include as a special case the entanglement-breaking 
channels of Sec. I2.8[ and are characterized by the property that their associated 
Choi-Jamiolkowski state is PPT, i.e. it remains positive under partial transposition 




V e [0, 1/2] ; 

pe [1/2,1] . 



(126) 
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in the reference space. States that possess this property might not be separable 
but their entanglement (called bound entanglement) is "weak" as it doesn't allow for 
distillation [1031 1104] . Binding maps (if not entanglement-breaking) allow for a certain 
amount of entanglement transfer: the latter however is always non-distillable and, 
even though it admits private communication between the sender and the receiver (see 
Sec. 14. 9p . it cannot be used for the faithful transfer of quantum information (not even 
in the presence of a two-way classical communication side line) [102J. For a recent 
study of the connections between anti-degradable and PPT channels, as well as on the 
characterization of the set of maps that have null quantum capacity see Ref. |169j . 



4-9. Quantum wiretap channel 

Here we review in brief the argument behind the derivation of Eq. (j!24p . Consider 
the situation of classical information transmission in which there are three parties: 
sender A, receiver B and the eavesdropper E. A mathematical model of the quantum 
wiretap channel comprises three Hilbert spaces 'Ha,'Hb,'He and the isometric map 
V : Ha — > Wb ® We, which transforms the input state pa into the state pbe = 
$a^be [pa] = V paV^ of the system BE, with partial states 

Pb = $ [Pa] = Tr E Vp A V*, p E = I [p A ] = TtbV Pa V ] . (127) 

Notice that this description is formally identical to that of the complementary channels 
in Sec. 12 .7\ where E denoted the environment (in the wiretap model this is supposed 
to be completely under control of the eavesdropper). 

Assume now that A sends the states {p A } with probabilities {p x }; then the parties 
B and E receive, correspondingly, the states {p%} and {p%} , and upper bounds for 
Shannon informations they receive are the quantities \ ({Px}, {p°b}) anc ^ X ({Px}, {p E }), 
as stated in Eq. ( IHUl) . In analogy with the classical wiretap channel |190] the "secrecy" 
of the transmission can be characterized by the quantity x ({Px}, {Pb}) ~ X ({Px}, {p%}) 
(here by secrecy we mean the amount of information which can be shared between the 
A and B without informing E). In fact, the capacity for secret transmission of classical 
information is shown to be [H] 

C p ($a^be) = lim - max \ X ({p^}, VW) - X ({p^}, > ( 12 8) 

where the maximum is taken over the families of states = {p\ {n) } in Uf 1 and the 
probability distributions p^ = {pf 1 ^} (we use the notations p l B(n) = [pVn)] > P%m — 

Assuming that the input states p x A are pure, and denoting by p = pa = ^2 x PxPa 
the average state of the input ensemble, from fl 1 2 9 [) we obtain the key relation [155J 

I e (p,$)=S(*\p))-S(*\p)) 



x ({p x }, mp A m - x ({px}, m PA ]}) , (129) 
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where we used Eq. (11221) and the fact that the states p x BE = V p x A V^ are pure implying 
S{*\fi\) = HPb) = S{p%) = S(${p A }) , (130) 

for all x. The identity fl 1 2 9 [) provides the fundamental connection between the quantum 
and the secret classical capacities which underlies the proof of Eq. (11241) given in Ref . |44j . 
Here we only notice that since in the computation of C p (<&a-^be) one takes into account 
all ensembles (not just the pure ones for which Eq. dl29[) hold), we get the following 
inequality 

C P M>Q($) , (131) 

which in general can be strict - see e.g. Refs. [9H1 E7] - with the notable exception of 
degradable channels for which 

C p {$ a ^be) = Q{*) = Qi($). (132) 

To conclude this discussion of the wiretap channels we briefly mention the vast 
field of quantum cryptography which constitutes a self-consistent portion of quantum 
information science (for reviews relating this subject we refer to [63 1 1183] ). 

4.. 10. Capacities for Gaussian channels 

In this section we briefly discuss the classical and the quantum capacities for Gaussian 
channels: in particular we focus on the single-mode CPTP maps analyzed in Sec. 13.51 

When speaking of the classical capacities for continuous- variables systems, to obtain 
reasonable finite quantities, one should introduce an energy constraint onto input 
ensembles similarly to what was done for c-q Gaussian channel in Example 3 of Sec. 14.31 
On the other hand, for the quantum capacity, even though constraining the input may be 
reasonable from a practical point of view, this is not strictly necessary as Q($) remains 
finite even in the unbounded case. Therefore in what follows we define C X (<&,E) to be 
the value of ( 1951) where the maximum is taken over ensembles {p x , p x } with the average 
state p = ^2 x p x Px satisfying the energy constraint 

Tipa ] a < E , (133) 

(a and a) being the annihilation and creation operator of the mode). Analogously, the 
entanglement-assisted classical capacity C ea ($, E) is defined by the expression (11171) 
where the maximum taken over the states p satisfying Eq. f II 3 3 H . 

It is not known in general if C X ($>,E) is additive under taking tensor products of 
Gaussian channels which prevents from identifying C x (<b,E) with the full constrained 
capacity C(§,E). However the additivity property (1941) holds for entanglement-breaking 
channels satisfying ( 1721) . Another case where additivity of C X (<&,E) with the energy 
constraint was established is the case of pure loss channel, i.e. attenuator (k < 1) with 
Nq = (environment in the vacuum state) [58]. The actual computation of C X (3>,E) 
is in general also an open problem: there is a natural conjecture that the maximum in 
C X ($>,E) for a quantum Gaussian channel with quadratic energy constraint is attained 
on a Gaussian ensemble of pure Gaussian states [96J 11611 1191j . but so far this was only 
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established for c-q channels and the pure loss channel [5S]. It is also worth pointing 
out that this conjecture can be conveniently reformulated in terms of a property of the 
minimum output entropy [571 EHJ [55] . If the conjecture is true, then in the cases (i), (ii), 
and (iii) defined in Sec. 13.51 the optimal ensemble is the continuous ensemble formed by 
the coherent states |C)(CI distributed with a Gaussian density p(Q = ^ exp (— \(\ 2 /E), 
yielding 

C($,E)=g(N , (E))-g(N'(0)), (134) 

where N'(E) = k 2 E + max{0, k 2 — 1} + iVo is the mean number of quanta at the output 
when the input number of quanta is E, and where g(x) is defined as in Eq. (1881) . 

The computation of the entanglement-assisted capacity C ea (Q,E) is a relatively 
simpler problem as, on one hand, no regularization over multiple uses is required, and 
on the other hand, the quantity to be maximized (the quantum mutual information) is a 
concave function [96] which, even in the presence of the linear constraint (11331) . admits 
a regular method of solution. In particular, for a Gaussian channel the maximum is 
always attained on a Gaussian state which can be found as a solution of certain Kuhn- 
Tucker equations. The expression of C ea (§,E) for the one-mode channels was derived 
in Ref. [96J and generalized to the multimode case in Ref. [60] . Explicitly it is given by 

C ea ($, E) = g(E) + g(N'(E)) - g(D + (E)/2) - g(D-(E)/2) , (135) 

where g(x) and N'(E) as before and where D±(E) = D(E) + N'(E) ± E - 1, with 
D(E) = y/(E + N'{E) + l) 2 - 4k 2 E{E + 1). 

In general entanglement-breaking channels have zero quantum capacity Q(&) = 0. 
However in any case the domain (!72|) is superseded by the broader domain iVo > 
min(l,fc 2 ) — 1/2 where the channel is anti-degradable and hence has zero quantum 
capacity [32]. On the other hand, in the case Nq = 0, k 2 > 1/2 the channel 
is degradable [311 [32], hence the quantum capacity of the attenuation/amplification 
channel with Nq = and the coefficient k is equal to the maximized single-letter 
Gaussian coherent information 

Q($) = sup/ c (p,$) =max|o,log 2 p^| , (136) 

an expression conjectured in Ref. [96J and proved in Ref. |192j . The case with iVo > 
remains open question. Enforcing the energy constraint as in Eq. (11331) . the above 
expression is replaced by 

Q($,E) = max{0 1 g(N t (E))-g(D + (E)/2)-g(D.{E)/2)} . (137) 

A plot of the above quantities is presented in Fig. @] for the special case of the 
attenuation channel (|69|) with N = 0, for which the expressions simplify as follows 

C($,E) =g(k 2 E), 

C eQ ($, E) = g{E) + g(k 2 E) - g((l - k 2 )E) = 2Q ea ($, E) , 

Q($,E) = max{0,g(k 2 E) - g((l - k 2 )E)} . (138) 

46 




Figure 4. (Color online) Capacities (|1 38[) of the attenuation Bosonic Gaussian 
channel (|69|) for Nq = as a function of the attenuation parameter k 2 and for an 
energy constraint (|133[) with E = 10. Specifically: classical capacity C($ 7 E) [58] ( red 
curve); quantum capacity Q($,E) |192j (black curve); entanglement-assisted classical 
capacity C ea ($,E) [96] (blue curve), and entanglement- assisted quantum capacity 
Q ea ($,E) = C ea (&,E)/2 (green curve). Notice that for k = 1 (noiseless case), 
Q($,E) = C($,E) = Q ea ($,E). Also for k 2 < 1/2 the channel becomes anti- 
degradable [31] [32] and the quantum capacity vanishes. In the plot all the quantities 
have been rescaled by g(E) (i.e. the optimal (entanglement-unassisted) transmission 
rate achievable for transmissivity k = 1). 

Here Q ea (<3>, E) is the entanglement- assisted quantum capacity of the channel $ (i.e. the 
quantum capacity which is achievable when the sender and the receiver are provided 
with prior shared entanglement): from general results p2] it is known to be always 
equal to half of the corresponding classical entanglement-assisted capacity. We also 
stress that for N — the reported value (11381) for C($,E) is the exact value of the 
classical capacity [58] . 

4.. 11. The variety of quantum channel capacities 

The three capacities defined in Eqs. ((23}, ffTTTL (fT24Tl are related as Q($) < C($) < 
C ea ($) and form a basis for defining and investigating the diversity of various capacities 
of a quantum communication channel, which arises by the application of additional 
resources, such as reverse or direct communication, correlation, or entanglement. In 
classical information theory it is well known that feedback does not increase the Shannon 
capacity which is essentially the unique characteristic of the classical channel. In 
the quantum case similar property is established [22] for the entanglement-assisted 
capacity C ea ($). Regarding the quantum capacity it is known that it can not 

be increased with an additional unlimited forward classical communication [T71 [9]. 
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Figure 5. Schematic comparison between the capacities of a memoryless quantum 
channel $ (see text for the definitions of the various quantities). The vertical 
"inequalities" of the scheme follow from the fact that each transferred qubit of 
information can carry a classical bit (e.g. see also Fig. HJ). 

However, Q(<&) can be increased if there is a possibility of transmitting the classical 
information in the backward direction. Such a protocol would allow one to create 
the maximum entanglement between the input and output, which can be used for 
quantum state teleportation. By this trick, even channels with zero quantum capacity 
supplemented with a classical feedback can be used for the reliable transmission of 
quantum information |135[ 122] (a notable exception is PPT channels which have 
null quantum capacity even in the presence of the feedback). Furthermore Smith 
and Yard |171j recently provided an explicit example of an interesting phenomenon 
named superactivation: there exist cases in which, given two quantum channels $1, $2 
with zero quantum capacity (Q($i) = Q(&2) = 0), it is possible to have Q($i <8> $2) > 
(the latter being the capacity of the communication line $!®$2 obtained by using jointly 
$1 and $2)- The example of |171] was build by joining an anti-degradable channel $1 
with a PPT channel $2, and a more general construction was given in Ref. [24J. 

A comparison of the various capacities which arise from employing additional 
resources is presented in Fig. El where the symbol "<" should be understood as "less 
than or equal to for all channels and strictly less for some channels" [12]. In such 
scheme Q^. and C<_ denote respectively the quantum and classical capacities in the 
presence of a feedback. The symbol represents the quantum capacity in the 
presence of two-side classical communication [14J. The corresponding classical capacity 
C is computed under the limitation that the side communication is independent 
of the message transmitted through the main channel [12]. It is also known that 
C ea = 2Q ea [17] and that for some other pairs both inequalities are possible. Further, one 
can construct so-called "mother" protocol which can implement all possible methods of 
transmission, including those mentioned above when using various additional resources 
(such such as feedback or entanglement) pQ. We finally point out that of all the 
quantities entering in scheme of Fig. [5j the function defined by the Eq. C I125[) 

is the only one which has no clear operational definition in terms of information rate 
(apart from the cases in which it coincides with the quantum capacity - e.g. for the 
degradable channels). Still we have inserted it in the list as it provides a lower bound 
for Q(<&). Notice also that C x (<3>) corresponds to the classical capacity of the channel $ 
restricted to separable encodings. 

In the classical information theory the role of the Shannon capacity is twofold: 
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the converse statement of the coding theorem gives a fundamental upper bound on 
the channel performance which serves as an important benchmark for real information 
processing system. This is also the case for the present state of art with quantum 
channel capacities. On the other hand, the direct statement of the coding theorem 
asserts that the bound is asymptotically achievable, although the proof does not give a 
practical recipe. In fact for almost 50 years after the Shannon proof the real performance 
was well below the theoretical bound and only more recently efficient practical codes 
appeared with rates approaching the capacity. This is still to be done in the quantum 
case, and the existing quantum error-correcting codes are the first promising steps in 
that direction. As an important example, in the papers j67J [73] symplectic codes were 
successfully applied to demonstrate constructively achievable rates close to the capacity 
for the additive classical noise channel ( 17TT) . 



4- 12. Relative entropy 

Before concluding our survey of quantum channel capacities and their entropic 
expressions, it is useful to consider a fundamental quantity - the quantum relative 
entropy - which underlies many information characteristics. Given two density operators 
p, a in % , one defines the relative entropy of p with respect to a as 

S(p\\a) =Tip(\ogp -log a) . (139) 

This quantity provides an important (asymmetric) measure of distinguishability of the 
states p, a; it is nonnegative and is equal to zero if and only if p = a [1841 1138) . 

The most important property of the relative entropy which underlies several 
key facts in quantum information theory and nonequilibrium statistical mechanics is 
monotonicity under the action of quantum channels 

S(p\\a) > S($\p]\\$[a]) . (140) 

In words, this means that states become less distinguishable after an irreversible 
evolution $. The first proof of the monotonicity property is due to Lindblad [122] 
who derived it from the strong subadditivity of quantum entropy, Eq. (11141) . 

To stress the relevance of the relative entropy in the context of quantum 
information, let us first notice that the quantum mutual information (110 II) can be written 
as 

J(p,$) = S{p BR \\p B ® p R ) , (141) 

implying J(p, $) > unless pre = Pr® Pb when I(p, $) = (as already mentioned in 
Sec. I4.5p . and the data processing inequality (11161) as a particular instance of Eq. ( 1140ft . 

Next, the ^-information (179"]) also admits a representation in terms of S(p\\a), 
namely [156J 

X ({Px}, {Px}) = ^Px S (p x \\ ^iV/v) , (142) 
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From this expression one can invoke the monotonicity property (I140p to imply the 
following data processing inequality 

X ({p x }, {$(p*)» ^ X ({Px}, {p,}) , (143) 

which in particular yields the bound (I80jl when $ is taken to be the q-c channel 
transforming quantum states into probability distributions (in this case the ^-function 
on the left hand side coincides with the Shannon information of the measurement process 
associated with the q-c mapping). 

The inequality (I143P has also profound implications for the coherent information. 
Applying it to the identity (11291) we obtain that if the channel $ is anti-degradable, 
then for any state p 

/ c (p,$)<0, (144) 

(correspondingly, for degradable channel I c (p, $) > 0). From the coding theorem (I124p it 
then follows that all anti-degradable channels have zero quantum capacity as anticipated 
at the end of Sec. 14.81 

Another useful application of the monotonicity ( 11401) concerns what may be called 
the generalized H-theorem which states that a bistochastic (unital) evolution does not 
decrease the entropy. In other words, the H-theorem implies that given a unital channel 
$ and arbitrary input state p, one has 

S{*\p]) > S(p) . (145) 
To see this in finite dimensional case (dim% = d < oo), use the identity 

S(p) = \ogd-S(p\\I/d), (146) 
where I jd is the chaotic state, and notice that 

sm P ]) = \o g d-sm P }\\i/d) = hgd-sm P ]\mi/d}) 

^ \ogd-S(p\\I/d) = S(p) . (147) 

where Eq. (I140p and the fact that <&[I/d] = I jd for bistochastic channels was used. 

For a general channel $ : ©("H) — > &(%) an interesting characteristic is the minimal 
entropy gain defined by the quantity 

G($)=inf \s(S\p\)-S{p)} , (148) 

PL J 

In contrasts to nonadditivity of the other similar quantity - the minimal output entropy 
infp S^f&fp]) - which we introduced at the end of Sec. 14.41 the quantity G($) is additive 
with respect to tensor product of channels (i.e. G($i <S> $2) = G($i) + G(&2)) as a 
simple consequence of the strong subadditivity (I114p . see [3]. For finite dimensional 
system it is easy to see that 

- log 2 d < < , (149) 

(this follows directly from the fact that von Neumann entropies are upperbounded by 
log 2 c?). However, as shown in [91], there is much better lower estimate 

-log 2 ||$[I]||<G($). (150) 
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which holds also for infinite dimensional systems assuming that the channel $ is such 
that = Yl^jLiVjVj i s a bounded operator. This implies that the generalized H- 
theorem is valid also for infinite dimensional unital evolutions $ if one restricts to the 
input states p with finite entropy. The inequality G(§) < no longer holds; for example, 
if $ is Bosonic Gaussian channel with parameters (K, I, /3), det K ^ (see Sec. 13.41) then 
the minimal entropy gain is equal to 

G($) = l g 2 | det AT| , (151) 

and is attained on Gaussian states [93] . The quantity | det-fTj is the coefficient of the 
change of the classical phase space volume under the linear transformation K (which can 
take arbitrary positive values). These results also suggest that for a general irreversible 
quantum evolution the role of this coefficient is played by the quantity 

5. Summary and outlook 

In this review we considered evolutions of open quantum systems from a quantum 
information viewpoint. More specifically, we discussed several scenarios where a sender 
and a receiver establish communication line by using some physical degrees of freedom 
(the information carrier) subject to quantum noise from the environment. The resulting 
transformation of sender's input states into receiver's output states - the quantum 
channel in the Schrodinger picture- is described by a completely positive trace preserving 
(CPTP) map. Several alternative representations of such maps were introduced and 
their main features were analyzed. In particular we have reviewed the operator-sum 
(Kraus) and unitary/isometric representations of a quantum channel, as well as its 
characterization in terms of the corresponding Choi-Jamiolkowski state. Composition 
rules along with the notions of dual (Heisenberg picture) and complementary channels 
have been presented. The general treatment was supplied with discussion of important 
particular cases of qubit, depolarizing, erasure channels as well as entanglement-breaking 
and dephasing channels. A whole section was dedicated to Bosonic Gaussian channels 
which constitute a basic class of information processors for continuous variable systems 
and provide a representation for some of the most usable quantum communication 
protocols. 

In the second part of the review we surveyed approaches to evaluation of the 
quality of a given quantum channel, basing on the notions of channel capacities and 
their entropic expressions. For the sake of clarity, we restricted the analysis to the 
basic case of memoryless communication models, where the noise operates identically 
and independently on each of the sender's inputs. We started with recollections of the 
Shannon entropy of a classical random source and the Shannon capacity of a classical 
channel (relevant for a communication line where all the stages of the information 
transferring - coding, transmission, decoding - are treated in terms of classical 
random processes), explaining how these quantities acquire operational meaning in the 
asymptotic of very long messages. Moving to the quantum domain, we demonstrated 



51 



how these quantities admit multiple generalizations. Several different notions of channel 
capacities have to be introduced in view of the fact that a quantum communication 
line can be used to transfer either classical or quantum messages, and assisted by 
various additional resources such as entanglement shared between the communicating 
parties or the classical feedback, which in the Shannon theory either do not exist 
(entanglement) or do not increase the capacity (feedback). These quantities have 
operational definitions generalizing that of the Shannon capacity and admit closed 
expressions in entropic terms given by fundamental quantum coding theorems. The 
quantum correlations (entanglement) display themselves in the increase of information 
transmission rates as compared to protocols without entanglement. The notable cases 
of this phenomenon discussed in our paper include the superadditivity of the classical 
information transmission rates with respect to entangled decodings and encodings, the 
gain of the input-output entanglement assistance and superactivation of zero quantum 
channel capacity. 

Information-theoretic view opens thus a completely new perspective of quantum 
irreversible evolutions - noisy communication channels - and enlightens fascinating 
landscape of the channel entropic characteristics, in which entanglement plays a crucial 
role. 
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